INDEX
Explanations
possessive pronouns and references to ownership or belonging
New Auto-Interp
Negative Logits
isms
-0.76
advant
-0.68
versions
-0.68
opted
-0.65
eatures
-0.65
terms
-0.64
operative
-0.63
urations
-0.63
hered
-0.61
abilities
-0.61
POSITIVE LOGITS
undis
0.67
hunger
0.63
unanswered
0.61
NEC
0.61
cms
0.61
RAW
0.61
nerve
0.61
WD
0.61
});
0.60
wic
0.60
Activations Density 0.021%