INDEX
    Explanations

    statements expressing belief, opinion, or personal perspective

    New Auto-Interp
    Negative Logits
    artney
    -0.72
    elle
    -0.67
    ption
    -0.64
    fare
    -0.62
     purportedly
    -0.61
    bies
    -0.59
    eller
    -0.58
    ueless
    -0.57
    lopp
    -0.57
    iae
    -0.57
    POSITIVE LOGITS
    poke
    0.80
     strongly
    0.71
     myself
    0.68
    ħ
    0.68
     passionately
    0.65
    ĸ
    0.63
     fortunate
    0.63
     personally
    0.60
    §
    0.60
    ļé
    0.59
    Act Density 11.750%

    No Known Activations