INDEX
Explanations
references to time periods and durations
New Auto-Interp
Negative Logits
erez
-0.17
æĸ¼
-0.14
unt
-0.14
upon
-0.14
ään
-0.14
awah
-0.14
apon
-0.13
unu
-0.13
836
-0.13
ilin
-0.13
POSITIVE LOGITS
immediately
0.33
leading
0.31
directly
0.29
immedi
0.27
following
0.25
proceeding
0.25
immediate
0.24
Leading
0.24
leading
0.23
Leading
0.23
Activations Density 0.063%