INDEX
Explanations
references to rankings or positions in a list
references to various rankings and their associated values
New Auto-Interp
Negative Logits
vous
-0.94
gm
-0.81
cise
-0.76
romy
-0.74
ña
-0.73
rote
-0.71
resp
-0.70
veh
-0.70
Ple
-0.68
pron
-0.67
POSITIVE LOGITS
rankings
1.44
Rankings
1.21
ranking
1.06
standings
0.94
Rank
0.92
uggest
0.87
elist
0.84
ratings
0.83
contenders
0.83
listings
0.83
Activations Density 0.007%