INDEX
    Explanations

    phrases related to personal responsibility and mindset

    themes related to responsibility and social issues

    New Auto-Interp
    Negative Logits
    ortium
    -0.64
    surprisingly
    -0.54
    utterstock
    -0.49
    berman
    -0.49
     Metropolitan
    -0.49
     Popular
    -0.49
     strikingly
    -0.48
     notable
    -0.48
    also
    -0.47
    Published
    -0.46
    POSITIVE LOGITS
    )."
    1.00
     â̦"
    0.99
    â̦"
    0.94
    ?".
    0.93
     ..."
    0.92
    !".
    0.89
    â̦."
    0.86
    ."
    0.86
    ..."
    0.86
    .")
    0.86
    Act Density 1.233%

    No Known Activations