INDEX
    Explanations

    words related to actions and initiatives towards creating change or solving problems

    phrases related to actions or plans aimed at improvement and assistance

    New Auto-Interp
    Negative Logits
     Typ
    -0.86
     Printed
    -0.84
     Writ
    -0.79
    pict
    -0.74
    Strange
    -0.73
    prints
    -0.72
    videos
    -0.70
     Proud
    -0.70
     Kubrick
    -0.70
     Sour
    -0.69
    POSITIVE LOGITS
     improve
    1.66
     mitigate
    1.65
     reduce
    1.64
     alleviate
    1.60
     strengthen
    1.56
     stabilize
    1.55
     avert
    1.54
     curb
    1.52
     stimulate
    1.50
     lessen
    1.47
    Act Density 0.286%

    No Known Activations