INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Officials
    -0.07
     une
    -0.06
    vens
    -0.06
    /includes
    -0.06
     imkân
    -0.06
    -0.06
     homme
    -0.06
     Awesome
    -0.06
     Über
    -0.06
     Parents
    -0.06
    POSITIVE LOGITS
    usk
    0.07
    hash
    0.07
    kind
    0.06
     Yog
    0.06
    bindings
    0.06
    onus
    0.06
    []}
    0.06
    bmp
    0.06
    0.06
     biodiversity
    0.06
    Act Density 0.039%

    No Known Activations