INDEX
Explanations
references and citations in scientific publications
New Auto-Interp
Negative Logits
furn
-0.61
asionally
-0.61
ndra
-0.60
urally
-0.59
ween
-0.59
taxp
-0.58
reon
-0.58
orate
-0.56
Shay
-0.56
ez
-0.55
POSITIVE LOGITS
1016
0.96
978
0.83
1007
0.78
1027
0.73
0004
0.71
ãĥł
0.69
112
0.69
1111
0.66
Journals
0.65
978
0.65
Activations Density 5.343%