INDEX
Explanations
references to scholarly publications and their details
New Auto-Interp
Negative Logits
zell
-0.17
udeau
-0.15
eria
-0.15
aine
-0.15
hausen
-0.14
hiro
-0.14
Pant
-0.14
ipeg
-0.14
_DLL
-0.14
nowhere
-0.14
POSITIVE LOGITS
Else
0.38
Else
0.37
Wiley
0.37
Rout
0.33
Springer
0.33
ELSE
0.31
Taylor
0.29
Taylor
0.28
ELSE
0.28
else
0.27
Activations Density 0.128%