INDEX
Explanations
references to scientific journals and publications
New Auto-Interp
Negative Logits
Alb
-0.16
gest
-0.15
Meh
-0.15
ante
-0.15
olik
-0.15
adar
-0.14
ocop
-0.14
atk
-0.14
sulf
-0.14
anners
-0.14
POSITIVE LOGITS
Nature
0.26
Nature
0.25
npj
0.25
nature
0.22
nature
0.20
Pointer
0.16
_EXTENDED
0.16
dataArray
0.16
ATURE
0.15
Nat
0.15
Activations Density 0.030%