INDEX
    Explanations

    phrases indicating time and past events or changes

    New Auto-Interp
    Negative Logits
     soon
    -0.29
    soon
    -0.25
     currently
    -0.23
     finally
    -0.21
     now
    -0.21
    currently
    -0.20
     recently
    -0.20
    缮åīį
    -0.20
    finally
    -0.19
     ultimately
    -0.18
    POSITIVE LOGITS
     merely
    0.24
     simply
    0.22
     solely
    0.20
     only
    0.19
    (before
    0.18
     thought
    0.18
     simplement
    0.17
     بÙĪØ¯Ùĩ
    0.17
     пÑĢоÑģÑĤо
    0.17
     iken
    0.16
    Act Density 0.229%

    No Known Activations