INDEX
Explanations
instances of authorship in texts
New Auto-Interp
Negative Logits
.wik
-0.14
sumer
-0.14
gregator
-0.14
CEE
-0.14
demokrat
-0.14
Wich
-0.14
CED
-0.13
ayi
-0.13
دÛĮگر
-0.13
صر
-0.13
POSITIVE LOGITS
Feld
0.15
by
0.15
ftime
0.14
istem
0.13
Miles
0.13
Unknown
0.13
uid
0.13
ÐĽÑİ
0.13
Various
0.13
441
0.13
Activations Density 0.035%