INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Majesty
    -0.77
     Emin
    -0.71
    REF
    -0.66
     Subst
    -0.65
    aceae
    -0.64
     ¶
    -0.64
    RET
    -0.64
     Frankfurt
    -0.63
    İĭ
    -0.61
    multipl
    -0.61
    POSITIVE LOGITS
     charism
    0.72
    usat
    0.68
     favorites
    0.65
    hani
    0.61
     captcha
    0.61
    govtrack
    0.60
    iframe
    0.57
     politician
    0.57
    WP
    0.56
    ernandez
    0.55
    Act Density 0.114%

    No Known Activations