INDEX
Explanations
references to specific political figures and entities
references to significant events and figures related to political resistance and protests
New Auto-Interp
Negative Logits
phal
-0.84
ties
-0.80
loving
-0.80
lishes
-0.78
marine
-0.77
ty
-0.76
Spit
-0.75
friend
-0.73
win
-0.73
trap
-0.71
POSITIVE LOGITS
Lerner
0.77
Standing
0.75
imester
0.74
Garland
0.72
orsi
0.71
hearings
0.70
iffs
0.70
ón
0.69
ograp
0.68
pollen
0.68
Activations Density 0.019%