INDEX
Explanations
phrases indicating a strong negative sentiment or denial
negations or statements indicating the absence of something
New Auto-Interp
Negative Logits
rn
-0.73
Ott
-0.69
ean
-0.68
Appears
-0.67
lish
-0.67
eals
-0.66
lean
-0.65
otton
-0.64
leaning
-0.64
staking
-0.64
POSITIVE LOGITS
shortage
1.13
doubt
1.12
indication
1.05
reason
0.98
guarantee
0.96
discern
0.95
conceivable
0.92
xious
0.91
longer
0.91
denying
0.90
Activations Density 0.061%