INDEX
Explanations
contractions with negations
phrases expressing uncertainty or lack of knowledge
New Auto-Interp
Negative Logits
misdem
-0.66
Agency
-0.66
afore
-0.64
)=(
-0.64
Passage
-0.63
awaited
-0.63
ANG
-0.61
elimination
-0.60
Passing
-0.60
upp
-0.58
POSITIVE LOGITS
't
1.49
ned
1.05
ates
0.88
essee
0.86
uts
0.85
ÃŃ
0.84
nas
0.82
etsk
0.80
nis
0.79
lv
0.77
Activations Density 0.087%