INDEX
Explanations
mentions of high-stress situations, conflicts, and challenges
phrases related to social and political movements
New Auto-Interp
Negative Logits
aka
-0.62
arta
-0.61
hangs
-0.60
onga
-0.59
Allaah
-0.58
awa
-0.58
onto
-0.57
osa
-0.57
anges
-0.56
Sketch
-0.56
POSITIVE LOGITS
Ples
0.65
last
0.65
enthusi
0.64
outcry
0.63
yesterday
0.62
initially
0.61
applause
0.59
pandemonium
0.59
uproar
0.59
Thiel
0.58
Activations Density 1.855%