INDEX
Explanations
phrases indicating relocation or movement to a new place
New Auto-Interp
Negative Logits
arya
-0.17
aub
-0.17
jid
-0.16
Schwe
-0.15
atas
-0.15
ysl
-0.15
jang
-0.14
usz
-0.13
umed
-0.13
osy
-0.13
POSITIVE LOGITS
egree
0.17
çĴĥ
0.14
443
0.14
eness
0.14
accent
0.14
Punch
0.14
ilis
0.13
zan
0.13
azor
0.13
Wolff
0.13
Activations Density 0.016%