INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    PsyNetMessage
    -0.73
    Constructed
    -0.65
     Suc
    -0.64
     autos
    -0.63
     IMAGES
    -0.63
     recip
    -0.63
    eers
    -0.60
     Mub
    -0.58
    orum
    -0.57
     Niet
    -0.57
    POSITIVE LOGITS
    canon
    1.24
    liner
    1.24
    quarter
    1.23
    liners
    1.22
    lining
    1.21
    quarters
    1.18
    hun
    1.16
    quartered
    1.15
    butt
    1.15
    gear
    1.14
    Act Density 0.405%

    No Known Activations