INDEX
Explanations
words or phrases that start with a hyphen, specifically focusing on negative sentiment or consequences
phrases or concepts related to negative or detrimental outcomes
New Auto-Interp
Negative Logits
ÏĤ
-0.70
Doodle
-0.69
estern
-0.67
Loll
-0.67
Vaughn
-0.66
Warn
-0.64
âĶľ
-0.64
Cumm
-0.63
Flores
-0.63
HCR
-0.62
POSITIVE LOGITS
based
1.14
of
1.10
heavy
0.96
operated
0.96
to
0.94
mounted
0.93
pain
0.92
time
0.91
level
0.91
tested
0.90
Activations Density 0.110%