INDEX
    Explanations

    phrases containing the word "but"

    instances of the word "but" indicating contrasting statements or objections

    New Auto-Interp
    Negative Logits
    pmwiki
    -0.77
    famous
    -0.71
     Written
    -0.65
    fuck
    -0.65
    catch
    -0.62
     Merch
    -0.61
    MY
    -0.61
    ghost
    -0.60
    Purchase
    -0.59
    代
    -0.59
    POSITIVE LOGITS
     noting
    1.15
     cautioned
    1.08
     concedes
    1.03
     stressing
    1.02
     admits
    1.01
     acknowledging
    0.95
     acknowledges
    0.94
     disagreed
    0.92
     stressed
    0.91
     nonetheless
    0.91
    Act Density 0.370%

    No Known Activations