INDEX
Explanations
mentions of organized public demonstrations or protests
references to specific dates and events related to marches
New Auto-Interp
Negative Logits
etheless
-0.74
terness
-0.74
xus
-0.70
eger
-0.70
ridges
-0.68
flix
-0.66
levant
-0.66
terrestrial
-0.66
oxide
-0.64
juicy
-0.64
POSITIVE LOGITS
marching
0.99
marched
0.92
marches
0.88
march
0.87
iage
0.86
cade
0.84
oppable
0.79
itzer
0.78
aging
0.76
antry
0.76
Activations Density 0.019%