INDEX
Explanations
references to specific rankings or awards associated with the letter "D"
New Auto-Interp
Negative Logits
iram
-0.19
jvu
-0.19
raw
-0.18
esc
-0.17
rop
-0.17
ocs
-0.17
ropp
-0.17
ictionary
-0.16
ays
-0.16
anny
-0.16
POSITIVE LOGITS
apo
0.16
coded
0.15
ael
0.15
Roose
0.15
prox
0.15
anela
0.15
-REAL
0.15
Ñħ
0.15
lad
0.14
lh
0.14
Activations Density 0.035%