INDEX
Explanations
phrases indicating a sequence of events
actions and events that are revealed or confirmed later in time
New Auto-Interp
Negative Logits
yesterday
-0.83
tomorrow
-0.76
lately
-0.73
today
-0.70
currently
-0.69
tonight
-0.67
formerly
-0.66
Thurs
-0.66
constantly
-0.63
yet
-0.63
POSITIVE LOGITS
regretted
0.77
relent
0.68
retract
0.67
iosyncr
0.66
acknow
0.65
regret
0.63
forg
0.62
Reviewer
0.61
apologise
0.61
gur
0.59
Activations Density 0.284%