INDEX
Explanations
adjectives related to negative emotions and medical conditions
terms related to various states of emotional or physical distress and vulnerability
New Auto-Interp
Negative Logits
Downloadha
-0.91
sidx
-0.79
76561
-0.78
ickr
-0.73
ioxide
-0.68
abwe
-0.66
ATT
-0.64
arty
-0.61
aeda
-0.61
è£ħ
-0.60
POSITIVE LOGITS
ness
3.27
nesses
2.67
NESS
2.12
ity
1.49
liness
1.48
ening
1.45
ly
1.44
itude
1.35
ened
1.24
cies
1.20
Activations Density 0.142%