INDEX
    Explanations

    the word "but" and its variations to highlight contrasting ideas or exceptions

    New Auto-Interp
    Negative Logits
    eniz
    -0.07
     Kaynak
    -0.07
     kıl
    -0.07
    åĽ
    -0.07
     åĨĨ
    -0.07
     lại
    -0.07
    enaire
    -0.07
    Ìģt
    -0.07
    oti
    -0.07
    .dump
    -0.07
    POSITIVE LOGITS
     otherwise
    0.08
    Anyway
    0.08
     nevertheless
    0.07
     Anyway
    0.07
     still
    0.07
     basically
    0.07
     Still
    0.07
     fine
    0.07
     nonetheless
    0.06
     Otherwise
    0.06
    Act Density 0.037%

    No Known Activations