INDEX
Explanations
proper nouns
specific non-English characters or symbols, particularly relating to names
New Auto-Interp
Negative Logits
naires
-0.70
ibly
-0.67
Chavez
-0.65
IBLE
-0.64
ibility
-0.64
wisdom
-0.64
Cobra
-0.62
sorts
-0.62
helicop
-0.61
Crus
-0.60
POSITIVE LOGITS
ö
1.25
zbek
1.24
ller
1.14
lde
1.11
hler
1.08
hl
1.07
sten
1.05
ppel
1.04
ÃŁ
1.00
misc
1.00
Activations Density 0.020%