INDEX
Explanations
texts with citation formats
numerical references and publication details in academic contexts
New Auto-Interp
Negative Logits
wed
-0.81
unlucky
-0.78
NXT
-0.75
lucky
-0.74
riding
-0.72
decor
-0.71
rooting
-0.71
inexper
-0.71
manship
-0.70
friendly
-0.70
POSITIVE LOGITS
âĵĺ
1.39
Ibid
1.33
http
1.25
Abstract
1.11
Reviewer
1.09
References
1.08
doi
1.07
Retrieved
1.04
doi
1.01
https
1.01
Activations Density 0.214%