INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
     pledge
    -0.08
    -0.08
     pubblic
    -0.07
     pall
    -0.07
    ाशी
    -0.07
     ponerse
    -0.07
    -0.07
     legitimacy
    -0.07
     поз
    -0.07
    POSITIVE LOGITS
    _COLUMN
    0.08
    NDER
    0.08
     Bien
    0.08
    _DOC
    0.08
    =headers
    0.08
    Bookmarks
    0.08
    Bem
    0.08
    OBS
    0.08
    REET
    0.08
    Nal
    0.07
    Act Density 0.000%

    No Known Activations