INDEX
Explanations
proper nouns related to specific geographical locations, particularly the Bronx
references to locations, particularly focusing on the Bronx and Harlem
New Auto-Interp
Negative Logits
OTE
-0.82
orthy
-0.81
ontent
-0.81
aking
-0.76
AKING
-0.76
akable
-0.76
lust
-0.76
soType
-0.75
neau
-0.74
OTT
-0.73
POSITIVE LOGITS
xual
1.24
Zoo
0.99
erick
0.80
cair
0.78
Bronx
0.78
Clown
0.70
plex
0.69
boiler
0.67
eno
0.66
Doodle
0.65
Activations Density 0.039%