INDEX
Explanations
references to a term involving family relations such as "Uncle."
references to "Uncle" in various contexts
New Auto-Interp
Negative Logits
icum
-0.84
istas
-0.81
mberg
-0.72
*/(
-0.70
ties
-0.69
emic
-0.66
ista
-0.65
Attribution
-0.65
ership
-0.64
izes
-0.63
POSITIVE LOGITS
veland
1.02
Uncle
0.99
uncle
0.85
heses
0.83
aned
0.81
Daddy
0.81
rex
0.79
princip
0.78
hetic
0.76
arest
0.75
Activations Density 0.014%