INDEX
Explanations
mentions of missing items or errors that need to be corrected
numerical values or amounts in various contexts
New Auto-Interp
Negative Logits
)."
-1.01
.")
-0.73
.'"
-0.72
'."
-0.70
]."
-0.69
.""
-0.67
"></
-0.60
"/>
-0.59
!'"
-0.59
)</
-0.56
POSITIVE LOGITS
respectively
0.46
DCS
0.44
aido
0.43
oat
0.41
Legion
0.40
crossover
0.40
itled
0.40
CTR
0.39
GOT
0.38
rolet
0.37
Activations Density 2.395%