INDEX
Explanations
the word "not" in sentences
New Auto-Interp
Negative Logits
Nice
-0.68
OSP
-0.63
å¥
-0.63
Measures
-0.61
Kings
-0.60
Intern
-0.60
Seasons
-0.59
Nine
-0.59
assessments
-0.59
IDENT
-0.58
POSITIVE LOGITS
necessarily
1.31
tolerate
1.16
icably
1.11
hesitate
1.07
relent
1.01
ogle
1.00
be
0.99
allow
0.97
hin
0.97
bud
0.94
Activations Density 0.071%