INDEX
Explanations
references to personal health and medical conditions
Followed by female pronouns
female pronouns she her
New Auto-Interp
Negative Logits
яке
-0.78
ньому
-0.74
quels
-0.69
ambao
-0.69
Оно
-0.68
которое
-0.67
dets
-0.65
οποίο
-0.63
Akismet
-0.63
cherchés
-0.63
POSITIVE LOGITS
her
4.25
she
4.15
herself
3.43
she
3.00
그녀
2.74
hers
2.69
她
2.68
彼女の
2.68
She
2.63
herself
2.63
Activations Density 2.704%