INDEX
Explanations
names of individuals or characters
New Auto-Interp
Negative Logits
å±±å¸Ĥ
-0.16
Vak
-0.14
Kapoor
-0.14
Ïħν
-0.14
azine
-0.14
vyh
-0.13
*sp
-0.13
dostat
-0.13
oje
-0.13
rollable
-0.13
POSITIVE LOGITS
-san
0.16
onis
0.16
šek
0.14
ect
0.14
—who
0.14
ÑģилÑĮ
0.13
Whoever
0.13
istor
0.13
aval
0.13
EEK
0.13
Activations Density 0.068%