INDEX
Explanations
names of people
proper nouns and capitalized identifiers
New Auto-Interp
Negative Logits
referen
-0.81
contempor
-0.77
metic
-0.75
bullet
-0.75
cow
-0.74
bean
-0.73
confer
-0.73
delegate
-0.71
hemor
-0.70
finger
-0.69
POSITIVE LOGITS
ses
1.13
asses
1.12
les
1.12
ps
1.12
bles
1.11
als
1.11
ents
1.10
zes
1.09
ds
1.07
ues
1.07
Activations Density 0.154%