INDEX
    Explanations

    phrases related to opinions or evaluations

    sentences that express definitive or conclusive statements

    New Auto-Interp
    Negative Logits
     disemb
    -0.76
     nodd
    -0.72
     dep
    -0.69
     purse
    -0.67
    yip
    -0.67
    soDeliveryDate
    -0.67
     dips
    -0.66
     explan
    -0.66
     agent
    -0.65
    ogene
    -0.65
    POSITIVE LOGITS
     Eventually
    1.14
     Ultimately
    1.14
     Instead
    1.11
     Thankfully
    1.10
     Fortunately
    1.09
     Initially
    1.08
     Afterwards
    1.07
     Then
    1.06
     However
    1.06
     Unfortunately
    1.06
    Act Density 0.770%

    No Known Activations