INDEX
Explanations
annotations or references in text, particularly those starting with '@'
New Auto-Interp
Negative Logits
iske
-0.18
akra
-0.16
pter
-0.15
icing
-0.14
unami
-0.14
otti
-0.14
.Actions
-0.14
Manager
-0.13
thon
-0.13
Manager
-0.13
POSITIVE LOGITS
article
0.30
article
0.29
Article
0.26
misc
0.25
misc
0.25
.article
0.25
Article
0.24
_article
0.24
-article
0.22
Misc
0.20
Activations Density 0.004%