INDEX
Explanations
names of individuals, possibly focusing on surnames
proper nouns, particularly names and titles
New Auto-Interp
Negative Logits
rish
-0.77
orget
-0.70
pmwiki
-0.69
bearings
-0.66
bilt
-0.66
ilde
-0.65
hattan
-0.63
Medal
-0.62
sburgh
-0.61
puff
-0.61
POSITIVE LOGITS
manship
0.81
hao
0.77
ciples
0.75
conn
0.73
ahime
0.71
terior
0.70
Introduced
0.70
RECT
0.69
WINDOWS
0.67
mustard
0.65
Activations Density 0.082%