INDEX
Explanations
expectations and societal standards placed on women
New Auto-Interp
Negative Logits
Loud
-0.14
grit
-0.14
imperson
-0.14
loud
-0.14
energetic
-0.14
avern
-0.13
professionalism
-0.13
BirleÅŁik
-0.13
IMessage
-0.13
energ
-0.13
POSITIVE LOGITS
soft
0.43
soft
0.39
Soft
0.38
gent
0.37
Soft
0.36
gentle
0.35
softer
0.33
delicate
0.32
tender
0.31
gent
0.31
Activations Density 0.413%