INDEX
Explanations
names and terms related to political figures
words with "ar" or "un" in various forms
New Auto-Interp
Negative Logits
nesday
-0.76
ancial
-0.68
ankind
-0.67
rongh
-0.65
ascus
-0.65
glers
-0.63
wcs
-0.63
etheless
-0.59
sake
-0.59
awed
-0.58
POSITIVE LOGITS
wine
0.70
itect
0.70
inian
0.69
aston
0.65
agos
0.64
tes
0.63
ondo
0.62
ICT
0.62
Rah
0.62
omaly
0.62
Activations Density 0.069%