INDEX
    Explanations

    words starting with the prefix 'un'

    New Auto-Interp
    Negative Logits
     briefs
    -0.75
    OPLE
    -0.74
     Tut
    -0.70
     MX
    -0.69
    hetti
    -0.67
    anwhile
    -0.66
     Madden
    -0.65
     Blitz
    -0.64
     Dynamics
    -0.63
     Tackle
    -0.61
    POSITIVE LOGITS
    ruly
    1.22
    balanced
    1.22
    assuming
    1.20
    cles
    1.18
    earned
    1.17
    ifying
    1.16
    ipolar
    1.14
    availability
    1.13
    readable
    1.13
    numbered
    1.12
    Act Density 0.796%

    No Known Activations