INDEX
Explanations
of study, impact, competition, government, people
New Auto-Interp
Negative Logits
a
0.74
the
0.69
an
0.63
стых
0.61
jších
0.61
вый
0.60
the
0.59
とに
0.59
petróleo
0.59
aların
0.58
POSITIVE LOGITS
organization
0.74
endeavor
0.68
hots
0.62
사용하여
0.60
увагу
0.60
emphasized
0.59
ГӀ
0.59
νο
0.59
enthusiast
0.59
ization
0.58
Activations Density 0.054%