INDEX
Explanations
numerical references and citations in academic texts
New Auto-Interp
Negative Logits
arkan
-0.15
ød
-0.15
umu
-0.15
strup
-0.15
або
-0.15
Virt
-0.15
patch
-0.14
DST
-0.14
ابÛĮ
-0.14
ее
-0.14
POSITIVE LOGITS
Mick
0.15
fuse
0.15
ahun
0.14
Vacuum
0.14
ago
0.14
obl
0.14
Thatcher
0.14
daily
0.14
vacuum
0.13
iden
0.13
Activations Density 0.030%