INDEX
Explanations
phrases indicating impossibility
phrases indicating impossibility or lack of options
New Auto-Interp
Negative Logits
streng
-0.81
rongh
-0.76
asts
-0.71
ahime
-0.69
xon
-0.69
zbollah
-0.67
inion
-0.67
eals
-0.67
notations
-0.67
gments
-0.65
POSITIVE LOGITS
whatsoever
1.12
anybody
1.06
anyone
1.01
around
0.91
else
0.86
anymore
0.84
anywhere
0.82
anything
0.80
somebody
0.77
you
0.77
Activations Density 0.060%