INDEX
Explanations
content related to the employment and roles of women in the workplace
Preceding a list or bullet point
physical, exercise, and work
New Auto-Interp
Negative Logits
."]
-0.97
.",
-0.95
".
-0.88
.
-0.86
.’”
-0.85
.[/
-0.83
."</
-0.83
.”—
-0.82
.'"
-0.82
.";
-0.80
POSITIVE LOGITS
=
0.88
->
0.84
&
0.80
ppl
0.77
:
0.74
->
0.74
-->
0.72
-
0.72
+
0.71
->$
0.70
Activations Density 0.294%