INDEX
Explanations
indicative verbs and statements about future events and plans
New Auto-Interp
Negative Logits
por
-0.43
dan
-0.42
ровании
-0.41
sering
-0.40
fony
-0.40
en
-0.39
Success
-0.39
ITUDE
-0.38
soal
-0.38
)|^{-0.38
POSITIVE LOGITS
now
1.33
теперь
1.18
henceforth
1.15
Теперь
1.15
Теперь
1.12
now
1.05
désormais
1.04
doré
1.02
Now
0.99
这下
0.98
Activations Density 0.226%