INDEX
Explanations
the term "wife" and its variations, indicating a focus on familial relationships
New Auto-Interp
Negative Logits
inalg
-0.74
مو
-0.72
Kenn
-0.71
appart
-0.69
Mov
-0.67
bước
-0.67
Kenn
-0.67
llon
-0.65
ttal
-0.65
hoàn
-0.64
POSITIVE LOGITS
wives
1.28
wife
1.27
wife
1.21
WIFE
1.19
wives
1.14
Wife
1.10
Wives
1.03
Wife
0.99
smtplib
0.90
y
0.87
Activations Density 0.152%