INDEX
Explanations
mentions of the name "Wendy"
references to specific individuals, particularly those named Wendy or Julie
New Auto-Interp
Negative Logits
atory
-0.88
unin
-0.78
atar
-0.75
alion
-0.75
Ëľ
-0.74
chall
-0.73
nown
-0.72
tarian
-0.71
istically
-0.71
ogn
-0.70
POSITIVE LOGITS
Wendy
0.98
Bee
0.80
Blossom
0.74
Rice
0.74
Jen
0.71
Chun
0.70
Vanessa
0.67
sauces
0.66
issance
0.65
ank
0.64
Activations Density 0.026%