INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    html
    -0.07
     Payne
    -0.06
    .money
    -0.06
    .activity
    -0.06
    .xaml
    -0.06
    HTML
    -0.06
    GetInstance
    -0.06
    -0.06
    Na
    -0.06
    _feature
    -0.06
    POSITIVE LOGITS
    bare
    0.07
     misdemean
    0.06
     ztr
    0.06
     momentarily
    0.06
     شهری
    0.06
     něm
    0.06
    ¼
    0.06
    endez
    0.06
     CharSet
    0.06
     غير
    0.06
    Act Density 0.010%

    No Known Activations