INDEX
Explanations
terms related to provoking emotions or stirring up reactions
themes related to provocation or incitement
New Auto-Interp
Negative Logits
WATCHED
-0.73
esan
-0.69
route
-0.68
rendered
-0.67
handle
-0.64
uters
-0.63
ians
-0.63
é¾įå
-0.62
çĦ
-0.61
Controls
-0.61
POSITIVE LOGITS
havoc
1.01
enthusiasm
0.88
controversy
0.84
excitement
0.83
curiosity
0.81
fires
0.80
frenzy
0.80
Development
0.80
laughter
0.79
pandemonium
0.77
Activations Density 0.108%