INDEX
Explanations
names of specific individuals
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
atform
-0.96
sburgh
-0.94
icular
-0.87
ures
-0.87
isphere
-0.87
ional
-0.84
ogene
-0.83
nostic
-0.83
lessly
-0.83
icles
-0.83
POSITIVE LOGITS
Rowe
1.27
Row
0.76
FANTASY
0.75
Christy
0.73
OWN
0.73
idge
0.71
Russo
0.71
Kitty
0.70
dden
0.69
VEL
0.69
Activations Density 0.028%