INDEX
Explanations
locations and geographical names
New Auto-Interp
Negative Logits
ibili
-0.17
ahir
-0.17
asca
-0.16
ogg
-0.16
sip
-0.15
adoo
-0.15
andler
-0.14
iT
-0.14
asad
-0.14
mate
-0.14
POSITIVE LOGITS
Kore
0.16
ilos
0.14
EFR
0.14
rzy
0.13
stem
0.13
Dickinson
0.13
ãĤ¤ãĤº
0.13
rans
0.13
ros
0.13
ë¦Ħ
0.13
Activations Density 0.076%