INDEX
Explanations
terms related to living or working in foreign countries
New Auto-Interp
Negative Logits
oku
-0.18
Zuk
-0.17
à¸Ļ
-0.17
ruz
-0.15
inda
-0.15
kara
-0.14
ÏĢο
-0.14
dings
-0.14
alogy
-0.14
et
-0.14
POSITIVE LOGITS
jÅ¡ÃŃ
0.16
782
0.16
ollar
0.15
azer
0.14
tog
0.14
SIDE
0.14
館
0.14
Ã
0.14
Vars
0.14
üssen
0.13
Activations Density 0.006%