INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Became
    -0.61
    Limited
    -0.60
    ointed
    -0.59
    fw
    -0.58
    whe
    -0.58
    bit
    -0.58
    iverse
    -0.55
    awar
    -0.55
    Premium
    -0.55
    iol
    -0.54
    POSITIVE LOGITS
     usual
    1.17
     previous
    1.15
     predecessors
    1.10
     traditional
    1.06
     typical
    1.04
     counterparts
    1.03
     conventional
    0.99
    usual
    0.99
     others
    0.95
     predecessor
    0.89
    Act Density 2.236%

    No Known Activations