INDEX
Explanations
references to funding institutions and grant information in research papers
New Auto-Interp
Negative Logits
oarece
-0.70
محفوظة
-0.70
ᅠ
-0.69
rozwo
-0.68
cherchés
-0.68
mieście
-0.65
poziomie
-0.64
helyzet
-0.63
podró
-0.63
kereszt
-0.61
POSITIVE LOGITS
Przypisy
0.81
Marcin
0.81
Marcin
0.80
Pologne
0.79
CommonModule
0.79
Lewandowski
0.76
Polish
0.75
Witcher
0.73
Polish
0.73
Poland
0.73
Activations Density 0.115%