INDEX
    Explanations

    references to the passage of time, especially in relation to past events

    New Auto-Interp
    Negative Logits
    uka
    -0.18
    odor
    -0.17
    uk
    -0.17
    uo
    -0.17
    овÑĸ
    -0.16
    cken
    -0.15
    rab
    -0.14
    ouse
    -0.14
    rink
    -0.14
     '..',
    -0.14
    POSITIVE LOGITS
    edition
    0.17
    arp
    0.15
    -fashioned
    0.14
    -wow
    0.14
     Gest
    0.14
    -step
    0.14
     ÙħÛĮÙĦادÛĮ
    0.14
    ittance
    0.14
    _compat
    0.14
    flash
    0.14
    Act Density 0.017%

    No Known Activations