INDEX
Explanations
words related to negative emotions or attitudes towards a person or situation
negative sentiments and expressions of disappointment or disgust
New Auto-Interp
Negative Logits
glers
-0.72
cycle
-0.71
Transition
-0.69
hike
-0.69
sealing
-0.68
hiking
-0.68
rotation
-0.68
density
-0.67
playback
-0.66
flow
-0.65
POSITIVE LOGITS
ceptive
1.26
ignant
1.24
atisf
1.19
interested
1.16
otent
1.15
respect
1.12
urious
1.12
isive
1.10
assion
1.08
righteous
1.07
Activations Density 0.132%