INDEX
Explanations
mentions of the word "daddy."
references to a specific name or title, particularly in relation to 'addy' or 'Daddy'
New Auto-Interp
Negative Logits
aeper
-0.72
âĸ¬
-0.67
©¶æ
-0.67
ELL
-0.66
ubiqu
-0.65
Rare
-0.64
PRES
-0.64
PER
-0.64
MENT
-0.63
WATCHED
-0.63
POSITIVE LOGITS
addy
1.63
atta
0.82
ickets
0.79
ikk
0.78
wagon
0.77
vier
0.75
daddy
0.72
cakes
0.71
aniel
0.71
Thomson
0.70
Activations Density 0.004%