INDEX
    Explanations

    words related to minimizing, reducing, or preventing

    actions or concepts associated with reducing negative impacts or minimizing risks

    New Auto-Interp
    Negative Logits
    ker
    -0.87
    enegger
    -0.85
    king
    -0.81
    swick
    -0.76
    otle
    -0.75
    gob
    -0.74
    join
    -0.73
    cart
    -0.71
    leader
    -0.69
    worldly
    -0.67
    POSITIVE LOGITS
    imize
    0.92
     minimizing
    0.82
     minimize
    0.80
     distractions
    0.78
     amounts
    0.77
     maximizing
    0.77
     minimized
    0.71
     misunderstand
    0.71
     utilization
    0.71
    imal
    0.71
    Act Density 0.034%

    No Known Activations