INDEX
    Explanations

    statements emphasizing or discussing responsibilities

    discussions about personal and social responsibilities

    New Auto-Interp
    Negative Logits
    vae
    -0.84
    arthed
    -0.76
    tering
    -0.70
    design
    -0.69
    atin
    -0.69
    ergy
    -0.69
    clude
    -0.68
    corn
    -0.68
    mpeg
    -0.67
    inton
    -0.64
    POSITIVE LOGITS
     responsibility
    0.96
     responsibilities
    0.94
     delegated
    0.87
     Responsibility
    0.83
    lessly
    0.82
    lessness
    0.76
     respons
    0.75
    axter
    0.74
     obligation
    0.74
     owed
    0.73
    Act Density 0.019%

    No Known Activations