INDEX
    Explanations

    instances of the word "but" emphasizing contrast or transition in discussions

    New Auto-Interp
    Negative Logits
    ÄĻż
    -0.15
    icer
    -0.14
     maar
    -0.14
    ä½Ĩæĺ¯
    -0.14
    ouver
    -0.14
    yet
    -0.14
    agment
    -0.14
    phant
    -0.13
    transparent
    -0.13
    oor
    -0.13
    POSITIVE LOGITS
    cher
    0.17
    tery
    0.16
    ystack
    0.16
     despite
    0.16
    tk
    0.15
    chie
    0.15
    tern
    0.14
     Appe
    0.14
    lauf
    0.14
    auer
    0.14
    Act Density 0.087%

    No Known Activations