INDEX
Explanations
references to significant political events and their implications
New Auto-Interp
Negative Logits
adro
-0.19
resa
-0.16
cura
-0.15
ottes
-0.15
antibiot
-0.15
adesh
-0.15
taire
-0.14
opal
-0.14
#ab
-0.14
iffe
-0.13
POSITIVE LOGITS
Capitol
0.36
riot
0.36
ins
0.32
Jan
0.31
Riot
0.30
riots
0.29
January
0.29
MAG
0.29
mob
0.28
Proud
0.27
Activations Density 0.028%