INDEX
    Explanations

    punctuation and transitional descriptors in text

    New Auto-Interp
    Negative Logits
     takže
    -0.62
     sondern
    -0.61
     joten
    -0.58
    就知道
    -0.56
     بلکه
    -0.55
    piram
    -0.54
     utan
    -0.54
     והוא
    -0.53
    はじめに
    -0.53
    hésite
    -0.53
    POSITIVE LOGITS
     However
    1.76
     however
    1.75
    However
    1.56
    however
    1.44
     entanto
    1.14
    Однако
    1.08
    Cependant
    1.07
     Однако
    1.05
     Cependant
    1.02
     tuttavia
    1.02
    Act Density 0.250%

    No Known Activations