INDEX
Explanations
words related to negative judgments or evaluations
expressions of negative judgments or sentiments, particularly the word "terrible."
New Auto-Interp
Negative Logits
ership
-0.86
pai
-0.86
irs
-0.85
ilus
-0.80
aver
-0.75
eters
-0.73
cript
-0.72
paio
-0.72
izen
-0.71
gat
-0.70
POSITIVE LOGITS
sounding
0.81
havoc
0.78
NESS
0.77
awful
0.77
nightmares
0.76
horrible
0.75
headache
0.74
nightmare
0.73
adolesc
0.72
ordeal
0.72
Activations Density 0.022%