INDEX
Explanations
phrases related to parenting and family responsibilities
New Auto-Interp
Negative Logits
niece
-0.22
nephew
-0.22
granddaughter
-0.20
grandson
-0.19
ứ
-0.17
ñas
-0.14
WithIdentifier
-0.14
ubat
-0.14
orz
-0.14
Puppy
-0.13
POSITIVE LOGITS
parents
0.94
parent
0.88
Parents
0.78
parents
0.75
parent
0.72
Parent
0.71
Parents
0.70
-parent
0.69
(parent
0.66
Parent
0.66
Activations Density 0.459%