INDEX
Explanations
words related to men's issues
references to men's issues or topics related to men's rights
New Auto-Interp
Negative Logits
yz
-0.72
PN
-0.62
GMT
-0.59
ozy
-0.59
milo
-0.59
sky
-0.58
ei
-0.58
Osc
-0.58
quickShipAvailable
-0.57
Prim
-0.57
POSITIVE LOGITS
selves
1.05
lightly
0.92
aying
0.91
ourced
0.87
wered
0.86
atisf
0.86
avior
0.86
ayers
0.84
aunders
0.82
omew
0.80
Activations Density 0.169%