INDEX
Explanations
names of individuals
proper nouns related to individuals, particularly names
New Auto-Interp
Negative Logits
arist
-0.68
bg
-0.67
quer
-0.66
chal
-0.63
ggle
-0.63
20439
-0.63
dear
-0.61
eals
-0.61
semble
-0.61
UAL
-0.60
POSITIVE LOGITS
Fowler
1.33
herty
0.81
ocity
0.81
Osw
0.80
reys
0.79
shire
0.78
asma
0.76
enhagen
0.75
owler
0.74
GF
0.72
Activations Density 0.007%