INDEX
Explanations
the use of the pronoun "I"
New Auto-Interp
Negative Logits
Reverse
-0.66
tnc
-0.66
pires
-0.62
groupon
-0.59
Gap
-0.59
ses
-0.59
INGTON
-0.57
indistinguishable
-0.56
excess
-0.56
interstitial
-0.56
POSITIVE LOGITS
'm
1.40
've
1.30
'll
1.17
'd
1.13
suppose
1.11
dunno
1.10
ronic
1.05
guess
1.03
mean
1.00
hope
1.00
Activations Density 0.160%