INDEX
Explanations
specific names and terms related to politics and social issues
New Auto-Interp
Negative Logits
elier
-0.16
owy
-0.15
Fundamental
-0.15
гл
-0.14
ost
-0.14
igli
-0.14
avec
-0.14
NW
-0.14
apsed
-0.13
assic
-0.13
POSITIVE LOGITS
uffs
0.14
chie
0.14
izona
0.14
lesia
0.14
chalk
0.14
son
0.14
juana
0.13
amera
0.13
iller
0.13
č↵č↵
0.13
Activations Density 0.018%