INDEX
Explanations
mentions of grandfathers
references to familial relationships, specifically focusing on grandfathers and grandmothers
New Auto-Interp
Negative Logits
IL
-0.68
scape
-0.67
cock
-0.66
waves
-0.66
Parsons
-0.65
eth
-0.63
Scene
-0.62
ily
-0.62
Slay
-0.62
CP
-0.61
POSITIVE LOGITS
grandfather
3.24
grandparents
3.16
grandmother
3.15
grandma
2.54
grandchildren
1.97
grandson
1.94
aunt
1.93
granddaughter
1.92
uncle
1.70
grand
1.61
Activations Density 0.021%