INDEX
Explanations
the word "habit" or phrases related to habits
references to behavioral habits
New Auto-Interp
Negative Logits
zac
-0.79
abad
-0.78
cross
-0.71
NZ
-0.66
sie
-0.66
pta
-0.66
RIS
-0.65
aucus
-0.64
SAR
-0.64
ndum
-0.63
POSITIVE LOGITS
ually
1.16
habits
1.12
uated
1.04
uation
1.03
habit
0.93
uate
0.86
hered
0.74
uating
0.72
ality
0.72
iliar
0.72
Activations Density 0.046%