INDEX
Explanations
words related to sparking intense reactions, debates, and movements
phrases that indicate the initiation of discussions or reactions to significant events or issues
New Auto-Interp
Negative Logits
Anat
-0.76
undone
-0.70
ACTED
-0.69
Definitive
-0.67
potatoes
-0.67
disapp
-0.65
implements
-0.64
peeled
-0.64
flats
-0.64
repealed
-0.62
POSITIVE LOGITS
inki
0.88
ire
0.79
riott
0.73
disbelief
0.72
wrath
0.72
Demand
0.71
aign
0.70
downfall
0.69
frenzy
0.69
ependence
0.68
Activations Density 0.283%