INDEX
    Explanations

    sentences indicating certainty or incredulity

    assertive statements or phrases that introduce certainty

    New Auto-Interp
    Negative Logits
    insula
    -0.82
    psey
    -0.79
    anwhile
    -0.72
    arthed
    -0.72
    ocene
    -0.71
    lasses
    -0.70
    aution
    -0.70
    entary
    -0.68
    uese
    -0.67
    NING
    -0.66
    POSITIVE LOGITS
     someday
    0.76
    è¦
    0.74
     irritated
    0.71
     deserved
    0.70
     offended
    0.69
     suffice
    0.65
    apo
    0.65
     ric
    0.65
     forgiven
    0.64
    founded
    0.64
    Act Density 0.023%

    No Known Activations