INDEX
Explanations
phrases related to feelings of hopelessness
New Auto-Interp
Negative Logits
APH
-0.62
chedel
-0.61
anwhile
-0.61
ickr
-0.60
soType
-0.59
appa
-0.57
Flavoring
-0.57
ipers
-0.57
xon
-0.57
edia
-0.56
POSITIVE LOGITS
ness
1.11
nesses
1.00
ly
0.79
fall
0.75
helpless
0.74
NESS
0.74
miser
0.74
ingly
0.69
hopeless
0.68
dest
0.60
Activations Density 11.013%