INDEX
Explanations
references to family
references to family-related concepts
New Auto-Interp
Negative Logits
*/(
-0.91
psey
-0.88
tical
-0.81
ane
-0.72
umbn
-0.70
lite
-0.69
Archdemon
-0.67
oppy
-0.67
jriwal
-0.67
igo
-0.67
POSITIVE LOGITS
resemb
0.96
ILY
0.94
members
0.87
patriarch
0.87
members
0.85
caregivers
0.80
resemblance
0.78
hood
0.78
ilial
0.77
hesis
0.77
Activations Density 0.037%