INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     XmlNode
    -0.07
    .publish
    -0.07
    ्बन
    -0.07
    conditional
    -0.07
     devoid
    -0.07
    ‌باشد
    -0.07
     protections
    -0.06
     İŞ
    -0.06
    Translatef
    -0.06
     WebClient
    -0.06
    POSITIVE LOGITS
    mile
    0.08
     amateur
    0.08
    Rh
    0.07
    ner
    0.07
     Beginners
    0.07
     novice
    0.07
     Gaw
    0.07
    _Ex
    0.07
     thi
    0.06
    lady
    0.06
    Act Density 0.010%

    No Known Activations