INDEX
Explanations
citations and bibliographic references
New Auto-Interp
Negative Logits
Table
-0.15
andi
-0.15
Right
-0.15
panse
-0.14
bach
-0.14
Entry
-0.14
List
-0.14
Directors
-0.14
/Table
-0.14
usercontent
-0.14
POSITIVE LOGITS
doi
0.28
note
0.24
journal
0.23
doi
0.23
.publisher
0.22
publisher
0.22
note
0.22
keywords
0.21
publisher
0.21
journal
0.21
Activations Density 0.005%