INDEX
Explanations
mentions of family relationships, specifically grandparents and grandchildren
references to "grand" or "great-grand" relationships or terms
New Auto-Interp
Negative Logits
Downloadha
-1.01
ijk
-0.67
Alvarez
-0.64
tics
-0.63
livest
-0.60
qqa
-0.59
argon
-0.59
Dialogue
-0.59
uality
-0.58
Altern
-0.57
POSITIVE LOGITS
father
1.20
mother
1.13
grand
1.02
parents
1.01
Prix
0.98
daughter
0.97
iosity
0.96
pa
0.95
parent
0.89
child
0.89
Activations Density 0.006%