INDEX
Explanations
phrases indicating time and past events or changes
New Auto-Interp
Negative Logits
soon
-0.29
soon
-0.25
currently
-0.23
finally
-0.21
now
-0.21
currently
-0.20
recently
-0.20
缮åīį
-0.20
finally
-0.19
ultimately
-0.18
POSITIVE LOGITS
merely
0.24
simply
0.22
solely
0.20
only
0.19
(before
0.18
thought
0.18
simplement
0.17
بÙĪØ¯Ùĩ
0.17
пÑĢоÑģÑĤо
0.17
iken
0.16
Activations Density 0.229%