INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     oppress
    -0.07
    .jump
    -0.07
     HIV
    -0.07
     occ
    -0.07
     Preconditions
    -0.06
     offence
    -0.06
     preced
    -0.06
    _plate
    -0.06
     avent
    -0.06
    inactive
    -0.06
    POSITIVE LOGITS
     er
    0.16
    IGN
    0.06
    ável
    0.06
     Meteor
    0.06
    AGAIN
    0.06
     ENG
    0.06
    aking
    0.06
    excel
    0.06
    ctr
    0.06
    inals
    0.06
    Act Density 0.002%

    No Known Activations