INDEX
Explanations
words related to negative emotions or situations, particularly those involving distress
words related to distress or discomfort
New Auto-Interp
Negative Logits
Knights
-0.66
Leaf
-0.65
Gob
-0.64
Bec
-0.64
elig
-0.63
Become
-0.59
Legion
-0.58
Emerald
-0.57
maker
-0.57
Oath
-0.56
POSITIVE LOGITS
ressing
4.35
ressed
2.47
ress
2.08
ression
2.02
resses
1.88
ressive
1.71
ressor
1.56
orting
1.21
acting
1.05
pressing
1.04
Activations Density 0.009%