INDEX
Explanations
issues or events causing controversy or uproar
phrases indicating significant public reactions or consequences
New Auto-Interp
Negative Logits
alan
-0.71
uthor
-0.67
alon
-0.67
acons
-0.67
enhagen
-0.66
aiden
-0.66
track
-0.65
essor
-0.65
orest
-0.65
anwhile
-0.64
POSITIVE LOGITS
havoc
1.33
disturbance
1.04
headaches
1.00
uproar
0.98
mayhem
0.96
stir
0.94
damage
0.89
unnecessary
0.89
panic
0.89
disruptions
0.88
Activations Density 0.189%