INDEX
    Explanations

    expressions of doubt or uncertainty

    New Auto-Interp
    Negative Logits
    etheus
    -0.76
    SPONSORED
    -0.74
    orthy
    -0.73
    merce
    -0.70
     Âł Âł
    -0.69
    anwhile
    -0.66
     undet
    -0.64
    eki
    -0.63
    aneously
    -0.61
    Result
    -0.61
    POSITIVE LOGITS
    soType
    0.69
     passionate
    0.69
    )</
    0.67
     sometimes
    0.65
     cried
    0.65
     cliché
    0.64
     someday
    0.64
     firsthand
    0.63
     sarcastic
    0.62
     sarc
    0.62
    Act Density 0.190%

    No Known Activations