INDEX
    Explanations

    textual symbols commonly used to convey emotion or tone

    references to self and personal responsibility

    New Auto-Interp
    Negative Logits
     adolesc
    -0.70
     Broadcasting
    -0.70
     Rosenthal
    -0.70
    ickets
    -0.67
     Trinidad
    -0.66
     CLR
    -0.65
     Avalon
    -0.64
     friction
    -0.64
     guiActiveUnfocused
    -0.64
     paperback
    -0.64
    POSITIVE LOGITS
    own
    1.08
    should
    1.00
    must
    0.98
    agree
    0.97
    swer
    0.96
    ï¸ı
    0.96
     deserve
    0.92
    ve
    0.91
    tal
    0.90
    audi
    0.89
    Act Density 0.159%

    No Known Activations