INDEX
    Explanations

    instances of the word "but" indicating contrasts or exceptions

    New Auto-Interp
    Negative Logits
    ConstraintMaker
    -0.65
    LookAnd
    -0.60
    ьаж
    -0.58
    brainly
    -0.56
    下载附件
    -0.56
    erializer
    -0.52
     Valentina
    -0.51
    itarianism
    -0.51
    Rohan
    -0.50
    ahon
    -0.50
    POSITIVE LOGITS
    numerusform
    0.66
    KommentareTeilen
    0.65
     Monfieur
    0.65
    клопе
    0.62
    říklad
    0.59
     ſever
    0.59
     internetowa
    0.57
     Conſ
    0.57
     خارجية
    0.57
    liferay
    0.57
    Act Density 0.008%

    No Known Activations