INDEX
Explanations
names or abbreviation initials followed by a particular letter grade
proper nouns, specifically names and titles
New Auto-Interp
Negative Logits
umbn
-0.69
saf
-0.68
éĹĺ
-0.66
ModLoader
-0.65
oided
-0.64
caps
-0.63
thirds
-0.63
Fair
-0.62
availability
-0.60
Reviewer
-0.59
POSITIVE LOGITS
.?
0.89
.,
0.82
.:
0.73
./
0.72
.;
0.71
ullivan
0.67
Armour
0.64
.,"
0.63
uce
0.63
#$
0.63
Activations Density 0.064%