INDEX
Explanations
proper nouns related to people's names
the mention of individuals, particularly their names
New Auto-Interp
Negative Logits
stood
-0.74
oan
-0.72
oras
-0.70
asel
-0.68
haps
-0.68
ãĥĢ
-0.68
acerb
-0.68
ãĥł
-0.67
onym
-0.66
onyms
-0.64
POSITIVE LOGITS
Webb
1.15
swick
0.88
sonian
0.83
Dixon
0.81
Everett
0.78
Telescope
0.75
icz
0.74
Hutch
0.74
Corker
0.73
glass
0.73
Activations Density 0.007%