INDEX
    Explanations

    assertions or clarifications about certainty and clarity in statements

    clarification and correction idioms

    New Auto-Interp
    Negative Logits
    [--
    -0.48
     trattano
    -0.43
     ModelExpression
    -0.42
     OFDb
    -0.42
    expandindo
    -0.41
    adă
    -0.41
     Reverso
    -0.41
     câte
    -0.40
    当たり
    -0.40
    uride
    -0.40
    POSITIVE LOGITS
    jmniej
    0.56
     keineswegs
    0.55
     bukanlah
    0.54
    あくまで
    0.54
     bukan
    0.52
    remacy
    0.50
    決して
    0.48
     Bukan
    0.47
    AnchorStyles
    0.47
    enciaga
    0.47
    Act Density 0.037%

    No Known Activations