INDEX
    Explanations

    adjectives describing negative characteristics

    terms related to unacknowledged or involuntary actions

    New Auto-Interp
    Negative Logits
    phrine
    -0.82
    anwhile
    -0.78
    hyde
    -0.76
    ĺħ
    -0.75
    onyms
    -0.73
    senal
    -0.72
     Defenders
    -0.71
    uyomi
    -0.71
    oples
    -0.69
    pmwiki
    -0.67
    POSITIVE LOGITS
    ritten
    1.01
    arranted
    0.98
    inding
    0.93
    ashed
    0.91
    ield
    0.89
    irth
    0.87
    avering
    0.85
    itt
    0.83
    ishable
    0.82
    atcher
    0.81
    Act Density 0.004%

    No Known Activations