INDEX
Explanations
numbers with dashes, colons, and units of measurement
numerical values formatted as measurements or dates
New Auto-Interp
Negative Logits
abase
-0.69
Ridley
-0.67
Thumbnails
-0.66
Foss
-0.63
Masquerade
-0.63
Ashes
-0.63
Leia
-0.56
yles
-0.56
vier
-0.55
tert
-0.55
POSITIVE LOGITS
999
0.82
acan
0.76
9999
0.74
090
0.71
dogs
0.69
576
0.67
unic
0.66
NPR
0.66
295
0.65
monary
0.65
Activations Density 0.127%