INDEX
    Explanations

    language that expresses necessity or challenges associated with personal and social responsibilities

    New Auto-Interp
    Head Attr Weights
    0:0.05
    1:0.02
    2:0.05
    3:0.23
    4:0.02
    5:0.05
    6:0.01
    7:0.07
    8:0.02
    9:0.01
    10:0.38
    11:0.02
    Negative Logits
    Registration
    -2.34
     yesterday
    -2.22
    sylvania
    -2.20
     tomorrow
    -2.18
    soon
    -2.14
    today
    -2.12
     1886
    -2.09
     ASAP
    -2.07
    caliber
    -2.02
    ntil
    -2.02
    POSITIVE LOGITS
     mistakes
    3.21
     misunderstand
    3.07
     incorrectly
    2.82
     annoy
    2.78
     wrongly
    2.69
     misinterpret
    2.69
     unintentionally
    2.69
     inaccur
    2.63
     misunderstood
    2.61
     tragedies
    2.58
    Act Density 0.263%

    No Known Activations