INDEX
Explanations
mentions of geographical locations and proper nouns
Place names (Sierra, Ang, San, Braz, Belo, Santa, New)
New Auto-Interp
Negative Logits
rån
-0.55
houver
-0.52
vulgaires
-0.51
ministerium
-0.51
щихся
-0.49
تومان
-0.47
obvious
-0.47
étrangères
-0.46
utnik
-0.46
suivantes
-0.46
POSITIVE LOGITS
tvguidetime
0.85
Efq
0.84
]='\
0.82
UnusedPrivate
0.80
########.
0.71
utafitiHapana
0.70
Monfieur
0.70
GEBURTS
0.68
0.68
myſelf
0.66
Activations Density 0.331%