INDEX
    Explanations

    capitalized terms including "Av" followed by a number

    New Auto-Interp
    Negative Logits
    hyde
    -0.65
    ptives
    -0.65
     respons
    -0.64
     inaccessible
    -0.63
    FORMATION
    -0.62
     Trouble
    -0.60
     sense
    -0.59
     Barnett
    -0.59
     handy
    -0.58
    utenant
    -0.58
    POSITIVE LOGITS
    ocado
    1.23
    atars
    1.23
    ril
    1.14
    iew
    1.07
    oided
    1.07
    ionics
    1.05
    iol
    1.05
    ille
    1.04
    iator
    1.03
    atar
    1.02
    Act Density 0.014%

    No Known Activations