INDEX
    Explanations

    words related to negative events or conditions

    references to negative or harmful characteristics and conditions

    New Auto-Interp
    Negative Logits
     Carbuncle
    -0.81
    æĸ¹
    -0.73
    ALK
    -0.72
    ORY
    -0.70
     Annotations
    -0.69
    BOOK
    -0.67
     FACE
    -0.66
     Authorization
    -0.66
     Polo
    -0.65
     Defenders
    -0.65
    POSITIVE LOGITS
    colm
    1.17
    ignant
    1.15
    adies
    1.14
    practice
    1.03
    formed
    1.02
    igned
    0.98
    arial
    0.97
    icious
    0.96
    ady
    0.94
    absor
    0.91
    Act Density 0.011%

    No Known Activations