INDEX
Explanations
names of researchers and their associated works
New Auto-Interp
Negative Logits
kasarigan
-0.82
للمعارف
-0.82
estekak
-0.80
queſta
-0.72
tvguidetime
-0.69
Geſch
-0.61
EconPapers
-0.61
&___
-0.60
ſind
-0.59
utafitiHapana
-0.57
POSITIVE LOGITS
,
0.56
and
0.39
https
0.36
0
0.34
le
0.33
2
0.33
1
0.31
INSTANCE
0.30
estancias
0.30
Fürst
0.30
Activations Density 0.393%