INDEX
Explanations
expressions indicating interest or involvement in various activities or subjects
New Auto-Interp
Negative Logits
üstü
-0.07
etak
-0.06
odash
-0.06
aras
-0.06
Masc
-0.06
ulas
-0.06
è±
-0.06
ustil
-0.06
æ
-0.06
unts
-0.06
POSITIVE LOGITS
habit
0.09
trouble
0.08
organ
0.08
leigh
0.08
abit
0.07
Trouble
0.07
ymm
0.07
habits
0.07
ONTAL
0.07
mess
0.07
Activations Density 0.015%