INDEX
    Explanations

    references to the concept of evil or malevolent behaviors

    New Auto-Interp
    Negative Logits
    ContentLoaded
    -0.61
    webElementXpaths
    -0.61
     voters
    -0.61
     Voter
    -0.61
    intios
    -0.60
    Personensuche
    -0.59
    enderror
    -0.59
    aclk
    -0.58
    findpost
    -0.58
    featureID
    -0.58
    POSITIVE LOGITS
     evil
    1.19
     demons
    0.92
     Evil
    0.91
     devils
    0.90
    evil
    0.89
    Evil
    0.89
     enemies
    0.89
     monsters
    0.88
     monster
    0.87
     demon
    0.83
    Act Density 0.123%

    No Known Activations