INDEX
Explanations
proper nouns, specifically names of people
mentions of the name "Rebecca" and its variations
New Auto-Interp
Negative Logits
teenth
-0.76
actic
-0.74
awaru
-0.72
tesque
-0.67
igree
-0.67
ailand
-0.66
arnaev
-0.66
dracon
-0.65
ramid
-0.65
ibaba
-0.65
POSITIVE LOGITS
Rebecca
0.98
McKenzie
0.90
Lopez
0.84
Koen
0.84
Upton
0.83
Bella
0.82
Chal
0.81
Mellon
0.81
issance
0.80
Lazarus
0.79
Activations Density 0.012%