INDEX
Explanations
names of specific individuals
names of specific individuals, particularly those with the last names Weaver and Weld
New Auto-Interp
Negative Logits
ocaust
-0.84
terior
-0.76
ournal
-0.72
grading
-0.72
eat
-0.71
words
-0.70
ibli
-0.70
phant
-0.69
duct
-0.68
etermined
-0.68
POSITIVE LOGITS
Weaver
1.36
lings
0.77
bats
0.73
Herz
0.71
quist
0.70
Gunn
0.69
Henderson
0.69
Kenobi
0.69
Hawkins
0.66
Witch
0.65
Activations Density 0.008%