INDEX
Explanations
words related to spousal relationships
mentions of the term "husband" in various contexts
New Auto-Interp
Negative Logits
spir
-0.72
JPM
-0.69
uum
-0.67
EVA
-0.66
"]=>
-0.65
McC
-0.65
ortmund
-0.63
cred
-0.63
1889
-0.62
Races
-0.62
POSITIVE LOGITS
Philip
0.82
pin
0.79
pins
0.75
dad
0.75
hood
0.73
Brandon
0.72
patriarch
0.72
TAIN
0.69
Uriel
0.69
ry
0.69
Activations Density 0.046%