INDEX
    Explanations

    instances of time indicators or temporal markers

    New Auto-Interp
    Negative Logits
     ring
    -0.17
     h
    -0.16
    Äħż
    -0.15
    agina
    -0.14
     punch
    -0.14
     English
    -0.14
    omencl
    -0.14
    ring
    -0.14
     mosquito
    -0.14
    XB
    -0.14
    POSITIVE LOGITS
    ehir
    0.18
    ÏĨη
    0.15
    \App
    0.15
     yên
    0.15
    zych
    0.14
    nga
    0.14
    ccione
    0.14
    ãĥ¼ãĥ©
    0.14
     κοι
    0.14
    hazi
    0.13
    Act Density 0.001%

    No Known Activations