INDEX
Explanations
phrases related to negation or exclusion
conjunctions and prepositions indicating alternative options or conditions
New Auto-Interp
Negative Logits
horm
-0.72
tackle
-0.68
Conan
-0.66
ETS
-0.64
ires
-0.64
efer
-0.63
nee
-0.60
onday
-0.60
DragonMagazine
-0.60
Reson
-0.59
POSITIVE LOGITS
chard
1.17
Else
1.10
acles
1.02
anything
1.02
acle
1.01
else
0.98
chid
0.96
nam
0.95
otherwise
0.91
ifice
0.87
Activations Density 0.125%