INDEX
Explanations
instances of the word "из" (meaning "from" or "of" in Russian)
New Auto-Interp
Negative Logits
iveau
-0.16
ade
-0.16
frei
-0.16
ROTO
-0.16
زة
-0.15
ulle
-0.15
ÏĦαι
-0.15
ÙĤÙĦ
-0.14
stÅĻÃŃ
-0.14
)))),
-0.14
POSITIVE LOGITS
rael
0.20
quierda
0.19
abela
0.18
-за
0.18
gon
0.18
ogen
0.15
ilian
0.15
gie
0.15
g
0.15
source
0.14
Activations Density 0.004%