INDEX
    Explanations

    mentions of health conditions, particularly malignancies

    terms related to malfunctions or negative conditions

    New Auto-Interp
    Negative Logits
    BOOK
    -0.82
    æĸ¹
    -0.74
     FACE
    -0.72
    ITED
    -0.72
    DragonMagazine
    -0.71
     Hobby
    -0.71
    zzo
    -0.70
     Carbuncle
    -0.69
    hetti
    -0.69
     Solitaire
    -0.68
    POSITIVE LOGITS
     mal
    1.15
     vulner
    1.00
    adies
    0.96
    colm
    0.91
    ignant
    0.88
    ciating
    0.79
    mal
    0.79
     challeng
    0.79
    practice
    0.78
    querade
    0.78
    Act Density 0.008%

    No Known Activations