INDEX
Explanations
locations or geographic regions
proper nouns, particularly names of locations and people
New Auto-Interp
Negative Logits
dress
-0.86
uit
-0.84
hog
-0.74
bed
-0.74
kick
-0.73
uckland
-0.73
oor
-0.71
ishly
-0.68
draw
-0.68
hart
-0.68
POSITIVE LOGITS
sylvania
0.81
arest
0.80
illin
0.74
Sut
0.72
д
0.72
FUL
0.70
ivable
0.69
onductor
0.69
Takeru
0.65
constitu
0.64
Activations Density 0.064%