INDEX
    Explanations

    past tense verbs

    New Auto-Interp
    Negative Logits
     гип
    -0.07
    rase
    -0.07
     cả
    -0.07
     การ
    -0.07
     překlad
    -0.07
     limitation
    -0.06
    anh
    -0.06
    anes
    -0.06
    -0.06
    walls
    -0.06
    POSITIVE LOGITS
     Sent
    0.07
    rej
    0.06
    (city
    0.06
     offered
    0.06
     stored
    0.06
     Applied
    0.06
    (len
    0.06
    _Err
    0.06
    латы
    0.06
    _used
    0.06
    Act Density 0.142%

    No Known Activations