INDEX
    Explanations

    reduce, minimize, decrease

    New Auto-Interp
    Negative Logits
     pickups
    0.42
     нара
    0.42
     ***!
    0.42
    oczy
    0.41
     확장
    0.40
    0.39
    मारा
    0.39
    _{+}$
    0.39
     Rookie
    0.39
    加热
    0.39
    POSITIVE LOGITS
    Subtract
    0.92
     subtract
    0.91
     subtracted
    0.90
     Subtract
    0.89
     decrease
    0.88
     subtracting
    0.87
    減少
    0.84
    decrease
    0.82
     decreases
    0.81
     subtraction
    0.80
    Act Density 0.175%

    No Known Activations