INDEX
    Explanations

    phrases related to defending or defense

    instances of the word "defend" in various contexts

    New Auto-Interp
    Negative Logits
    NetMessage
    -0.70
     explode
    -0.69
    hall
    -0.68
    Ju
    -0.68
    ucket
    -0.63
    foot
    -0.63
    Hop
    -0.63
    Machine
    -0.62
    mad
    -0.62
    bows
    -0.62
    POSITIVE LOGITS
     against
    0.97
    atively
    0.90
     defending
    0.87
     Against
    0.81
     defends
    0.76
    ively
    0.76
    orate
    0.75
    ably
    0.75
    iveness
    0.74
    ously
    0.72
    Act Density 0.026%

    No Known Activations