INDEX
Explanations
references to personal or possessive pronouns, especially in relation to individuals
New Auto-Interp
Negative Logits
цездатний
-0.62
UnsafeEnabled
-0.61
ihnach
-0.59
nakalista
-0.58
évaluateur
-0.56
wireType
-0.55
ագրություններ
-0.54
SharedDtor
-0.54
ambién
-0.52
principalColumn
-0.52
POSITIVE LOGITS
herself
0.57
her
0.48
सकती
0.46
she
0.46
herself
0.42
lass
0.40
但她
0.40
她
0.40
She
0.40
heroine
0.38
Activations Density 0.519%