INDEX
Explanations
contractions where 'not' is followed by a word
the word "shouldn't" and its variations, indicating expressions of advisement or critique
New Auto-Interp
Negative Logits
enthusi
-0.80
Powered
-0.70
locating
-0.69
withd
-0.68
encount
-0.68
gobl
-0.66
paran
-0.63
Herz
-0.63
fulfilled
-0.62
bombed
-0.62
POSITIVE LOGITS
't
1.58
ned
1.00
n
1.00
ny
0.99
ighed
0.90
nt
0.88
ÃŃ
0.86
no
0.83
ouch
0.82
ning
0.81
Activations Density 0.017%