INDEX
Explanations
mentions of family members, especially mothers and parents
New Auto-Interp
Negative Logits
Datuak
-0.40
Ze
-0.38
gdx
-0.36
veremos
-0.36
hand
-0.35
Misc
-0.35
съ
-0.35
BorderLayout
-0.35
fla
-0.35
mis
-0.35
POSITIVE LOGITS
father
0.66
grandfather
0.58
Father
0.58
👵
0.58
grandparents
0.58
Grandfather
0.57
FATHER
0.57
abuelo
0.56
grandmother
0.55
abuela
0.54
Activations Density 0.044%