INDEX
Explanations
names of places and entities related to health or governance
New Auto-Interp
Negative Logits
,
-0.91
of
-0.78
in
-0.78
on
-0.73
the
-0.70
.
-0.70
is
-0.69
for
-0.69
with
-0.68
that
-0.68
POSITIVE LOGITS
PhysRev
0.84
Paglinawan
0.84
كومونز
0.81
queſta
0.80
ſelves
0.79
RSITY
0.79
FunctionFlags
0.78
Meksiku
0.78
Савезне
0.77
choreographer
0.76
Activations Density 0.786%