INDEX
Explanations
proper nouns, specifically names like "Shawn"
occurrences of the name "Shawn."
New Auto-Interp
Negative Logits
keley
-0.92
scill
-0.82
gered
-0.75
vernment
-0.72
vernight
-0.72
ught
-0.69
女
-0.67
ña
-0.66
代
-0.65
nces
-0.65
POSITIVE LOGITS
Michaels
1.12
Kemp
0.94
sburg
0.90
ees
0.90
ee
0.83
ive
0.81
Shawn
0.79
Summers
0.75
Dillon
0.74
Thornton
0.74
Activations Density 0.024%