INDEX
    Explanations

    poetry-related content, possibly focusing on poems written about personal experiences or social issues

    New Auto-Interp
    Negative Logits
    owship
    -0.72
     narrator
    -0.70
     EDITION
    -0.68
    stood
    -0.67
    lain
    -0.65
    DERR
    -0.63
    ATIONAL
    -0.63
    worthiness
    -0.62
    UAL
    -0.62
    nings
    -0.61
    POSITIVE LOGITS
    pper
    1.32
    ppy
    1.24
    pping
    1.21
    etry
    1.16
    ppel
    1.14
    orer
    1.11
    achers
    1.11
    aching
    1.10
    inters
    1.10
    ppers
    1.09
    Act Density 0.029%

    No Known Activations