INDEX
    Explanations

    the word "grab" or similar terms indicating physical seizing or taking hold of something

    New Auto-Interp
    Negative Logits
    AMY
    -0.77
    xual
    -0.74
    SPONSORED
    -0.65
    present
    -0.65
    MQ
    -0.64
    ema
    -0.64
    acre
    -0.64
     Prol
    -0.62
    ingen
    -0.62
    ————
    -0.61
    POSITIVE LOGITS
     onto
    0.98
    bable
    0.94
    bers
    0.93
    bing
    0.90
     hold
    0.90
    hold
    0.86
     glances
    0.80
    ber
    0.78
    reau
    0.75
    bage
    0.75
    Act Density 0.044%

    No Known Activations