INDEX
Explanations
mentions of the word "Mother"
references to "Mother" in various contexts
New Auto-Interp
Negative Logits
NRS
-0.76
ript
-0.74
EY
-0.73
ickr
-0.70
ramer
-0.70
aping
-0.70
RAFT
-0.67
okers
-0.66
ype
-0.65
REP
-0.64
POSITIVE LOGITS
Mother
0.97
Teresa
0.94
ship
0.85
Mother
0.82
hood
0.80
Joy
0.75
hesis
0.74
Anne
0.73
Mary
0.73
heses
0.73
Activations Density 0.018%