INDEX
Explanations
words related to intense negative emotions like being appalled, disgusted, or horrified
New Auto-Interp
Negative Logits
-+
-0.74
Track
-0.64
main
-0.63
assets
-0.62
Status
-0.62
raft
-0.61
SS
-0.61
key
-0.61
commodity
-0.61
worth
-0.61
POSITIVE LOGITS
appalled
2.86
horrified
2.81
disgusted
2.62
outraged
2.56
baffled
2.54
alarmed
2.51
angered
2.44
offended
2.42
astonished
2.40
puzzled
2.32
Activations Density 0.043%