INDEX
Explanations
occurrences of citations and references to studies or external sources
New Auto-Interp
Negative Logits
duit
-0.16
enk
-0.15
dit
-0.15
avad
-0.14
Dit
-0.14
æº
-0.14
atin
-0.14
cession
-0.14
hin
-0.14
eder
-0.13
POSITIVE LOGITS
ncpy
0.17
activex
0.17
egt
0.15
prites
0.15
anlı
0.14
ienza
0.14
ãĥĭãĤ¢
0.14
çŃĴ
0.14
abr
0.14
Kỳ
0.14
Activations Density 0.055%