INDEX
Explanations
phrases related to personal responsibility and mindset
themes related to responsibility and social issues
New Auto-Interp
Negative Logits
ortium
-0.64
surprisingly
-0.54
utterstock
-0.49
berman
-0.49
Metropolitan
-0.49
Popular
-0.49
strikingly
-0.48
notable
-0.48
also
-0.47
Published
-0.46
POSITIVE LOGITS
)."
1.00
â̦"
0.99
â̦"
0.94
?".
0.93
..."
0.92
!".
0.89
â̦."
0.86
."
0.86
..."
0.86
.")
0.86
Activations Density 1.233%