INDEX
Explanations
mentions of a media outlet named "Mother."
New Auto-Interp
Negative Logits
ript
-0.75
ramer
-0.71
aping
-0.68
NRS
-0.66
ratulations
-0.64
quer
-0.63
jriwal
-0.61
wheelchair
-0.61
chnology
-0.59
raper
-0.59
POSITIVE LOGITS
Teresa
0.96
ship
0.95
hood
0.88
fuck
0.85
BRE
0.81
baugh
0.81
hesis
0.79
ships
0.79
Son
0.77
heses
0.77
Activations Density 0.026%