INDEX
Explanations
pronouns, particularly those referring to individuals or groups
Third-person pronouns
pronouns followed by verbs
New Auto-Interp
Negative Logits
pinulongan
-0.63
pomo
-0.53
taken
-0.45
est
-0.45
pras
-0.45
ordering
-0.44
реш
-0.44
anc
-0.43
Verde
-0.42
řeb
-0.42
POSITIVE LOGITS
can
0.98
never
0.96
had
0.95
would
0.94
have
0.91
always
0.90
didn
0.89
still
0.89
'):
0.88
could
0.86
Activations Density 0.259%