INDEX
Explanations
words related to political movements and affiliations
New Auto-Interp
Negative Logits
tizado
-0.35
dolay
-0.33
ISO
-0.33
Româ
-0.32
Buen
-0.31
крой
-0.31
sprechend
-0.30
ⓘ
-0.30
iso
-0.30
gamis
-0.30
POSITIVE LOGITS
ists
1.02
ISTS
0.95
ista
0.89
istas
0.89
IST
0.89
iste
0.86
ist
0.86
ister
0.85
istes
0.85
isted
0.83
Activations Density 1.710%