INDEX
Explanations
references to influential women and their contributions or societal roles
New Auto-Interp
Negative Logits
\<^
-0.15
enos
-0.15
.ISupportInitialize
-0.15
stab
-0.14
ιακ
-0.14
ικα
-0.14
:".$
-0.13
enis
-0.13
iteli
-0.13
ÏĦει
-0.13
POSITIVE LOGITS
;
0.18
);
0.18
ï¼īãĢģ
0.17
”;
0.17
;↵
0.17
nor
0.16
)ãĢģ
0.16
);
0.16
[];
0.15
];
0.15
Activations Density 0.641%