INDEX
Explanations
mentions of the word "Mother" with varying intensities
references to the word "Mother" and related phrases indicating maternal themes
New Auto-Interp
Negative Logits
ript
-0.83
ramer
-0.78
aping
-0.76
ahime
-0.71
orman
-0.71
neum
-0.69
ators
-0.66
jriwal
-0.65
aped
-0.65
ickr
-0.64
POSITIVE LOGITS
Teresa
0.91
Mother
0.88
BRE
0.81
Father
0.79
Son
0.79
hood
0.79
Mother
0.78
fuck
0.78
Daughter
0.77
Earth
0.74
Activations Density 0.049%