INDEX
Explanations
contractions indicating negation or questioning in written text
negative contractions in questions and statements
New Auto-Interp
Negative Logits
Reviewer
-0.78
cele
-0.69
æ©
-0.63
itone
-0.63
Ñģ
-0.63
places
-0.59
Cosponsors
-0.59
forms
-0.59
Tracker
-0.58
inen
-0.58
POSITIVE LOGITS
bother
0.67
berra
0.66
ync
0.63
gonna
0.62
ffff
0.62
igslist
0.61
estine
0.60
icable
0.60
necessarily
0.60
attering
0.59
Activations Density 0.071%