INDEX
    Explanations

    important verbs and phrases indicating actions or requests

    New Auto-Interp
    Negative Logits
    è£ķ
    -0.15
    tright
    -0.15
    roken
    -0.15
    ksen
    -0.15
    ضة
    -0.15
    923
    -0.15
    avour
    -0.14
    kol
    -0.14
    sole
    -0.14
     Pru
    -0.14
    POSITIVE LOGITS
    uib
    0.15
     Bark
    0.15
    ugo
    0.15
    æĬ¼
    0.14
    isy
    0.14
    одо
    0.14
    adesh
    0.14
     vin
    0.14
    Bug
    0.14
     bug
    0.14
    Act Density 0.000%

    No Known Activations