INDEX
    Explanations

    adversaries and threats

    New Auto-Interp
    Negative Logits
     लीप
    0.76
    ').':
    0.71
     memoir
    0.71
     Solubility
    0.71
    UITableView
    0.71
    高等学校
    0.69
    ubility
    0.67
    Worked
    0.67
    Receipt
    0.67
    ReLU
    0.65
    POSITIVE LOGITS
     intruders
    2.56
     attackers
    2.56
     enemy
    2.35
     invaders
    2.35
     terrorists
    2.26
     malicious
    2.22
     enemies
    2.22
     adversaries
    2.20
     attacker
    2.17
     assailant
    2.16
    Act Density 0.582%

    No Known Activations