INDEX
Explanations
references to habits and lifestyle choices
New Auto-Interp
Negative Logits
ream
-0.13
pool
-0.13
hash
-0.13
appiness
-0.13
puss
-0.13
ụ
-0.13
rength
-0.13
lg
-0.13
rowning
-0.13
Knox
-0.12
POSITIVE LOGITS
habit
0.79
habits
0.79
Hab
0.77
Habit
0.66
hab
0.66
hab
0.63
habit
0.62
ä¹ł
0.58
ç¿Ĵ
0.54
habitual
0.49
Activations Density 0.177%