INDEX
Explanations
numerical rankings and positions
rankings or positions in competitions or events
New Auto-Interp
Negative Logits
brim
-0.68
ROR
-0.60
Rules
-0.58
therape
-0.57
CRE
-0.56
Inquiry
-0.56
olutions
-0.56
Paper
-0.55
Substance
-0.55
Tang
-0.55
POSITIVE LOGITS
baseman
1.01
arily
0.93
hand
0.87
ority
0.80
onyms
0.79
isl
0.77
ranked
0.77
lane
0.75
oths
0.72
eter
0.72
Activations Density 0.071%