INDEX
Explanations
mentions of familial relationships, particularly the word "Dad"
mentions of "Dad" or variations of the term
New Auto-Interp
Negative Logits
atility
-0.77
CONT
-0.75
Flavoring
-0.73
lihood
-0.71
Ĥ¬
-0.68
veyard
-0.67
76561
-0.67
rawdownloadcloneembedreportprint
-0.67
Nost
-0.66
ichen
-0.63
POSITIVE LOGITS
liest
0.95
Dad
0.95
iji
0.87
father
0.87
hesis
0.87
dad
0.82
patriarch
0.81
daddy
0.81
Dad
0.80
ma
0.80
Activations Density 0.010%