INDEX
Explanations
references to current activities or conditions related to people or organizations
New Auto-Interp
Negative Logits
previously
-1.14
then
-1.06
initially
-1.05
originally
-1.03
ранее
-1.02
sebelumnya
-0.98
previous
-0.98
kiedyś
-0.97
anfangs
-0.97
zuvor
-0.95
POSITIVE LOGITS
weer
0.64
看来
0.57
ш
0.57
看來
0.54
来看
0.52
seks
0.52
为止
0.52
WEBPACK
0.52
шня
0.52
unrecogn
0.52
Activations Density 0.377%