INDEX
Explanations
proper nouns, specifically names of people and places
the names of individuals and specific geographical locations
New Auto-Interp
Negative Logits
amina
-0.76
iversary
-0.75
peed
-0.72
taker
-0.68
chers
-0.68
terson
-0.67
lis
-0.66
onomy
-0.66
âķIJâķIJ
-0.64
pee
-0.64
POSITIVE LOGITS
vernment
0.88
Gloria
0.84
GSL
0.75
Gw
0.73
estation
0.68
ilt
0.65
ILLE
0.64
HF
0.64
Lena
0.64
ossip
0.64
Activations Density 0.013%