INDEX
Explanations
proper nouns with titles or names
instances of ending punctuation
New Auto-Interp
Negative Logits
etheless
-0.75
wcs
-0.72
advis
-0.70
secondly
-0.70
translation
-0.70
charact
-0.68
ãĥĦ
-0.64
ignition
-0.63
ãĤ¨ãĥ«
-0.63
arrang
-0.63
POSITIVE LOGITS
Smith
1.03
Olson
1.00
Miller
1.00
Baker
1.00
Bernstein
0.99
Stephens
0.98
Gors
0.98
Ware
0.97
Decker
0.96
Peterson
0.96
Activations Density 0.034%