INDEX
Explanations
information related to geographical locations or specific places
New Auto-Interp
Negative Logits
koji
-0.28
italiano
-0.26
stesso
-0.25
electrónico
-0.24
completo
-0.22
himself
-0.20
Ñıкий
-0.19
français
-0.19
público
-0.19
ÙĨÙ쨳Ùĩ
-0.18
POSITIVE LOGITS
herself
0.40
αÏħÏĦή
0.30
italiana
0.29
gratuita
0.27
pública
0.27
ÑģÑĤала
0.26
latina
0.25
могла
0.25
коÑĤоÑĢаÑı
0.24
misma
0.24
Activations Density 0.431%