INDEX
    Explanations

    instances of the word "but" or similar transitional phrases indicating contrast or exception

    New Auto-Interp
    Negative Logits
     ravi
    -0.64
     Waray
    -0.61
    baya
    -0.60
     Litu
    -0.60
     Aire
    -0.60
    necked
    -0.59
    KeyName
    -0.56
     linden
    -0.55
     projective
    -0.55
    voo
    -0.55
    POSITIVE LOGITS
     but
    3.60
     But
    3.35
    But
    3.15
     pero
    3.07
    but
    3.06
     BUT
    2.88
     nhưng
    2.68
     tetapi
    2.61
    2.60
    BUT
    2.51
    Act Density 0.215%

    No Known Activations