INDEX
Explanations
phrases that indicate the necessity or importance of actions or understanding
New Auto-Interp
Negative Logits
autorytatywna
-1.22
nawr
-0.98
referenties
-0.96
ConstraintMaker
-0.93
tvguidetime
-0.90
ValueStyle
-0.83
Biôgrafia
-0.83
AssemblyCulture
-0.82
Renoir
-0.81
webdriver
-0.80
POSITIVE LOGITS
people
0.68
bagi
0.66
accro
0.59
the
0.58
there
0.56
a
0.52
mennes
0.52
to
0.52
we
0.50
someone
0.49
Activations Density 0.304%