INDEX
    Explanations

    instances where the text presents a counterpoint or contrast to an idea

    the conjunction "but," often used to introduce contrasting ideas or exceptions

    New Auto-Interp
    Negative Logits
    uto
    -0.68
    irl
    -0.65
    amus
    -0.64
    Times
    -0.63
    edu
    -0.62
    ige
    -0.62
    awan
    -0.62
    ories
    -0.61
    hal
    -0.60
    leck
    -0.59
    POSITIVE LOGITS
     alas
    1.26
     nevertheless
    1.25
     nonetheless
    1.16
     fortunately
    1.06
     luckily
    0.99
    tons
    0.93
    chery
    0.92
     ultimately
    0.92
     hey
    0.91
     unfortunately
    0.86
    Act Density 0.190%

    No Known Activations