INDEX
    Explanations

    instances of the word "avoid" along with the value 9 or 10, indicating an emphasis on caution or prevention

    instances of the word "avoid."

    New Auto-Interp
    Negative Logits
    iop
    -0.90
    geist
    -0.69
    essee
    -0.68
    Rated
    -0.67
    cart
    -0.66
    ART
    -0.64
    otle
    -0.63
    opter
    -0.63
    Ready
    -0.62
     Directorate
    -0.62
    POSITIVE LOGITS
     detection
    0.78
    ably
    0.75
     pitfalls
    0.74
     wasting
    0.72
    ading
    0.68
     evade
    0.67
    avoid
    0.67
     avoidance
    0.66
    nels
    0.66
    azaki
    0.65
    Act Density 0.026%

    No Known Activations