INDEX
Explanations
names of individuals
proper nouns, specifically names of individuals and entities
New Auto-Interp
Negative Logits
ilk
-0.68
opal
-0.67
undo
-0.66
bloodstream
-0.66
fitting
-0.64
hemisphere
-0.63
plement
-0.62
ctica
-0.62
izoph
-0.61
igmat
-0.60
POSITIVE LOGITS
illard
1.29
rand
0.78
ville
0.75
Dunn
0.73
stall
0.73
Lowe
0.71
yard
0.71
erness
0.70
ieri
0.69
stown
0.68
Activations Density 0.008%