INDEX
    Explanations

    words and phrases related to harsh or challenging situations

    a specific visual or formatting pattern in the text

    New Auto-Interp
    Negative Logits
     oven
    -0.76
     nuts
    -0.76
    eering
    -0.72
     Lumpur
    -0.68
    orts
    -0.67
    egu
    -0.66
     precaution
    -0.65
     hemor
    -0.65
     proced
    -0.64
     palm
    -0.63
    POSITIVE LOGITS
    meaning
    1.10
    advertisement
    1.08
    along
    1.07
    perhaps
    1.06
    feat
    1.05
    particularly
    1.03
    among
    1.03
    something
    1.01
    which
    1.00
    they
    0.99
    Act Density 0.050%

    No Known Activations