INDEX
    Explanations

    expressions of uncertainty or subjectivity about situations

    New Auto-Interp
    Negative Logits
    ests
    -0.82
    rouse
    -0.78
    orem
    -0.75
    otos
    -0.75
    andise
    -0.74
    ilts
    -0.71
    pez
    -0.69
    izons
    -0.68
    perature
    -0.68
    venge
    -0.68
    POSITIVE LOGITS
     unlikely
    0.93
     doubtful
    0.91
     probable
    0.78
     unclear
    0.75
     prudent
    0.72
     plausible
    0.71
     advisable
    0.70
     folly
    0.70
     feasible
    0.70
     imperative
    0.69
    Act Density 0.035%

    No Known Activations