INDEX
    Explanations

    phrases related to planning and carrying out harmful actions or schemes

    phrases related to plotting or planning violent acts or crimes

    New Auto-Interp
    Negative Logits
    framework
    -0.73
    imentary
    -0.72
     ðŁij
    -0.72
     Quartz
    -0.70
    required
    -0.68
     partName
    -0.67
    HUD
    -0.66
    ä¸Ĭ
    -0.65
    facing
    -0.65
    reflect
    -0.64
    POSITIVE LOGITS
     terrorist
    1.26
     murder
    1.25
     murders
    1.24
     terror
    1.24
     mayhem
    1.23
     terrorism
    1.22
     massacres
    1.20
     destruction
    1.19
     attacks
    1.19
     atrocities
    1.19
    Act Density 0.324%

    No Known Activations