INDEX
    Explanations

    words related to expressing opinions or making requests

    terms related to formal discourse and expressions of opinion

    New Auto-Interp
    Negative Logits
    orks
    -0.71
     Saud
    -0.64
    iliated
    -0.64
    ÃŁ
    -0.63
    ecause
    -0.62
    ccording
    -0.62
    uum
    -0.62
    respect
    -0.61
     airflow
    -0.60
    glomer
    -0.59
    POSITIVE LOGITS
    fulness
    0.85
     spree
    0.78
     regarding
    0.71
     exploits
    0.70
     foray
    0.69
     stance
    0.69
    iques
    0.68
     rampage
    0.66
    lessness
    0.65
     crusade
    0.64
    Act Density 0.314%

    No Known Activations