INDEX
Explanations
time periods and their duration references
New Auto-Interp
Negative Logits
lor
-0.17
subsequent
-0.16
orate
-0.15
à¤Ĥà¤ľ
-0.15
ÛĮزÛĮ
-0.15
reak
-0.14
.Tool
-0.14
/back
-0.13
aris
-0.13
bah
-0.13
POSITIVE LOGITS
later
0.40
Later
0.30
Later
0.30
later
0.29
earlier
0.28
Earlier
0.25
später
0.24
Earlier
0.23
alter
0.20
latter
0.20
Activations Density 0.028%