INDEX
    Explanations

    past-tense verbs that indicate actions or events

    New Auto-Interp
    Negative Logits
    á»ħ
    -0.15
    icut
    -0.15
     unzip
    -0.15
    ÃŃc
    -0.14
    resse
    -0.14
    gL
    -0.14
     LENG
    -0.14
    apol
    -0.13
     Roberto
    -0.13
    dress
    -0.13
    POSITIVE LOGITS
    aly
    0.15
    egal
    0.15
    iá»ĥn
    0.15
    chas
    0.15
    chan
    0.14
    enty
    0.14
    /goto
    0.14
    åł
    0.14
    chg
    0.14
    hlen
    0.14
    Act Density 0.055%

    No Known Activations