INDEX
    Explanations

    phrases indicating restrictions, limitations, or disclaimers

    New Auto-Interp
    Negative Logits
     câteva
    -0.64
     sometime
    -0.63
     almeno
    -0.60
     Slightly
    -0.60
     slightly
    -0.59
    SOME
    -0.58
    slightly
    -0.58
    MessageTagHelper
    -0.58
    somewhat
    -0.58
     puțin
    -0.57
    POSITIVE LOGITS
    yet
    0.76
     yet
    0.74
     YET
    0.64
     Yet
    0.62
    Yet
    0.62
     necessarily
    0.59
     harmed
    0.59
    contentLoaded
    0.58
    necessarily
    0.56
     exceed
    0.56
    Act Density 0.817%

    No Known Activations