INDEX
Explanations
words related to rejection or being uninterested
phrases indicating opposition or dissent
New Auto-Interp
Negative Logits
raq
-0.79
soType
-0.76
ased
-0.68
è¦ļéĨĴ
-0.68
forcement
-0.67
CVE
-0.64
racuse
-0.63
aptic
-0.63
azard
-0.63
=>
-0.61
POSITIVE LOGITS
necessarily
0.95
shy
0.85
anymore
0.84
earthly
0.81
conventional
0.80
whatsoever
0.80
any
0.80
anything
0.80
altogether
0.75
ever
0.75
Activations Density 0.564%