INDEX
    Explanations

    sequences indicating change or movement over time

    New Auto-Interp
    Negative Logits
    awi
    -0.16
    erken
    -0.16
    366
    -0.16
    ÏģÏħ
    -0.15
    amas
    -0.14
    -alist
    -0.14
    umbed
    -0.14
    äll
    -0.14
     Replacement
    -0.14
    orce
    -0.14
    POSITIVE LOGITS
     again
    0.26
    again
    0.25
     Again
    0.22
    Again
    0.22
     lại
    0.21
     novamente
    0.20
     ëĭ¤ìĭľ
    0.19
     Ñģнова
    0.19
     wieder
    0.19
     weer
    0.18
    Act Density 0.139%

    No Known Activations