INDEX
Explanations
comparisons involving numerical data and performance metrics
New Auto-Interp
Negative Logits
higher
-0.16
stry
-0.15
higher
-0.15
omu
-0.15
,...↵↵
-0.15
Higher
-0.15
,application
-0.14
žit
-0.14
íģ
-0.14
suspend
-0.14
POSITIVE LOGITS
only
0.30
smaller
0.26
mere
0.26
fewer
0.25
less
0.24
merely
0.24
apenas
0.23
Only
0.22
only
0.22
Only
0.21
Activations Density 0.109%