INDEX
Explanations
words related to negation, uncertainty, or questioning
the negative contraction "don't."
New Auto-Interp
Negative Logits
Dickinson
-0.75
Compass
-0.65
IAS
-0.64
Reborn
-0.60
Maiden
-0.60
Starts
-0.58
states
-0.58
tips
-0.58
ISON
-0.57
Pike
-0.56
POSITIVE LOGITS
bother
1.04
necessarily
1.03
deserve
0.99
discriminate
0.99
hesitate
0.98
¨
0.98
Í
0.98
care
0.96
intend
0.96
CARE
0.96
Activations Density 0.098%