INDEX
Explanations
political news and statements from politicians
New Auto-Interp
Negative Logits
ioch
-0.63
ihar
-0.63
suggest
-0.61
bec
-0.60
Zup
-0.59
puff
-0.57
ides
-0.56
educ
-0.55
bley
-0.55
gur
-0.55
POSITIVE LOGITS
art
0.78
affairs
0.70
azeera
0.68
Decay
0.65
senal
0.62
resp
0.61
lict
0.61
entreprene
0.60
matter
0.59
world
0.57
Activations Density 0.028%