INDEX
Explanations
terms that indicate temporal sequence or transitions in events
the word "Before" at the beginning of sentences or clauses.
New Auto-Interp
Negative Logits
dụ
-0.36
DUC
-0.31
houden
-0.30
voeren
-0.29
mitos
-0.29
laag
-0.29
profonda
-0.29
shtml
-0.28
eaked
-0.28
ㅜ
-0.28
POSITIVE LOGITS
hand
0.83
before
0.74
Before
0.71
andafter
0.70
Before
0.69
BEFORE
0.69
before
0.67
BEFORE
0.66
HAND
0.64
Мексичка
0.64
Activations Density 0.118%