INDEX
Explanations
phrases related to empowerment and gender issues
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.06
3:0.07
4:0.17
5:0.05
6:0.07
7:0.24
8:0.04
9:0.06
10:0.08
11:0.06
Negative Logits
dyl
-1.59
ming
-1.52
ensibly
-1.51
aten
-1.46
DragonMagazine
-1.45
signific
-1.43
Reverend
-1.42
Technician
-1.37
semb
-1.33
millenn
-1.33
POSITIVE LOGITS
elight
1.47
ORPG
1.41
ById
1.39
Awards
1.36
hardest
1.34
FOIA
1.34
Db
1.31
Article
1.31
inbox
1.30
challenge
1.29
Activations Density 0.001%