INDEX
Explanations
numeric and reference-related information in academic citations
New Auto-Interp
Negative Logits
ateg
-0.18
inar
-0.17
aub
-0.16
covering
-0.16
èĩ¨
-0.15
edom
-0.14
erable
-0.14
važ
-0.14
covering
-0.14
ạ
-0.14
POSITIVE LOGITS
idos
0.15
untas
0.15
amat
0.15
INET
0.14
umat
0.14
γοÏį
0.14
εν
0.14
Lup
0.14
211
0.13
ocks
0.13
Activations Density 0.034%