INDEX
Explanations
key indicators or metrics related to categorization or evaluation criteria
New Auto-Interp
Negative Logits
ernen
-0.15
atorio
-0.15
erset
-0.15
ycl
-0.15
Dodd
-0.15
ystone
-0.14
edly
-0.13
areth
-0.13
allon
-0.13
406
-0.13
POSITIVE LOGITS
chine
0.15
инов
0.15
ichel
0.15
ulu
0.15
bcm
0.15
iaux
0.14
šit
0.14
BT
0.14
elu
0.14
ecl
0.14
Activations Density 0.062%