INDEX
    Explanations

    words related to checking or ensuring something is correct

    New Auto-Interp
    Negative Logits
     Ruin
    -0.82
    NetMessage
    -0.69
    ulia
    -0.69
    laughter
    -0.68
    >>>>>>>>
    -0.67
    gling
    -0.67
     havoc
    -0.66
     accuse
    -0.65
     woes
    -0.64
     ado
    -0.63
    POSITIVE LOGITS
     able
    1.36
     respectful
    1.24
     compliant
    1.24
     accessible
    1.22
     aware
    1.21
     comfortable
    1.20
     safe
    1.19
     sufficiently
    1.18
     inclusive
    1.17
     resilient
    1.15
    Act Density 0.361%

    No Known Activations