INDEX
Explanations
phrases related to family connections such as son-in-law, brother, and marriage
phrases indicating familial relationships, specifically related to sons-in-law
New Auto-Interp
Negative Logits
ORTS
-0.66
eatures
-0.64
pressures
-0.63
houn
-0.63
cliffe
-0.62
dq
-0.61
earcher
-0.60
essor
-0.59
WT
-0.59
)].
-0.59
POSITIVE LOGITS
rette
0.63
Jol
0.61
umeric
0.61
Gaul
0.60
wered
0.60
Asc
0.59
Sax
0.59
opter
0.58
ault
0.57
Odin
0.56
Activations Density 0.318%