INDEX
    Explanations

    deceptive or manipulative tactics and strategies

    New Auto-Interp
    Negative Logits
    PerformLayout
    -0.74
    ThroughAttribute
    -0.68
     donation
    -0.61
    endforeach
    -0.61
    ãng
    -0.56
     NSCoder
    -0.56
    ="@+
    -0.55
     članak
    -0.54
     Matériau
    -0.54
    arar
    -0.54
    POSITIVE LOGITS
     tactics
    1.66
     tricks
    1.42
     tactic
    1.36
     strategies
    1.31
     Tactics
    1.29
     trick
    1.29
     strategy
    1.26
     cunning
    1.26
    strategies
    1.18
     clever
    1.16
    Act Density 0.322%

    No Known Activations