INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Mosul
    -0.07
     gồm
    -0.06
     kitty
    -0.06
    nictvím
    -0.06
     lần
    -0.06
     поход
    -0.06
     AFF
    -0.06
     Collect
    -0.06
     heures
    -0.06
    (mesh
    -0.06
    POSITIVE LOGITS
     encouragement
    0.07
     heraus
    0.07
     приклад
    0.07
    ิว
    0.06
     Nationals
    0.06
    T
    0.06
    0.06
     مثبت
    0.06
    -byte
    0.06
    HEL
    0.06
    Act Density 0.003%

    No Known Activations