INDEX
Explanations
mentions of people's names, particularly the name "Harrison"
proper nouns, particularly names associated with individuals
New Auto-Interp
Negative Logits
ggles
-0.84
gments
-0.73
ailand
-0.71
anooga
-0.70
terday
-0.69
uration
-0.68
hement
-0.67
ãĥ³
-0.66
reme
-0.65
osuke
-0.64
POSITIVE LOGITS
Crosby
0.96
Vaugh
0.86
reath
0.76
Ambro
0.76
gren
0.75
Rockefeller
0.73
Goodman
0.72
HAEL
0.70
uddy
0.69
alf
0.68
Activations Density 0.099%