INDEX
    Explanations

    references to time, specifically durations and time-related phrases

    New Auto-Interp
    Negative Logits
    IfExists
    -0.18
    iolet
    -0.17
    aison
    -0.16
     å¤
    -0.15
     Barr
    -0.15
    phem
    -0.15
    å¤ķ
    -0.14
    EXIT
    -0.14
    .nih
    -0.14
    IDL
    -0.14
    POSITIVE LOGITS
     ago
    0.32
    ago
    0.31
     Ago
    0.28
    AGO
    0.23
     önce
    0.22
     назад
    0.18
    åīį
    0.18
    ampo
    0.17
    isters
    0.16
     åīį
    0.15
    Act Density 0.009%

    No Known Activations