INDEX
    Explanations

    mentions of evil in various contexts

    instances and variations of the word "evil."

    New Auto-Interp
    Negative Logits
    raltar
    -0.78
    UNCH
    -0.76
    eding
    -0.74
    PsyNetMessage
    -0.74
    aro
    -0.73
    dropping
    -0.73
    kees
    -0.71
    akeru
    -0.70
    utra
    -0.68
    illation
    -0.68
    POSITIVE LOGITS
     incarn
    1.00
     deeds
    0.93
     deed
    0.90
     twin
    0.86
    nesses
    0.85
     genius
    0.84
     mastermind
    0.83
     lord
    0.79
     evil
    0.76
     empire
    0.75
    Act Density 0.027%

    No Known Activations