INDEX
    Explanations

    verbs related to actions of decrease or reduction

    instances of the word "reduce" and its variations, indicating a focus on minimizing or lowering something

    New Auto-Interp
    Negative Logits
    Found
    -0.71
    Truth
    -0.64
    REL
    -0.64
    Lord
    -0.62
    ¯¯
    -0.62
     Phant
    -0.60
    shake
    -0.60
     Saud
    -0.59
    old
    -0.59
    place
    -0.59
    POSITIVE LOGITS
     inhib
    0.87
     visibility
    0.78
    icides
    0.77
    uce
    0.77
    escal
    0.76
    ibrary
    0.75
     emissions
    0.75
     carbohyd
    0.75
    anguage
    0.75
     greenhouse
    0.74
    Act Density 0.049%

    No Known Activations