INDEX
Explanations
mentions or discussions related to feminism
references to feminism and feminist theories
New Auto-Interp
Negative Logits
laus
-0.87
sight
-0.75
acca
-0.75
worldly
-0.74
INESS
-0.73
Lago
-0.70
rip
-0.67
mg
-0.66
shape
-0.66
seeing
-0.66
POSITIVE LOGITS
feminists
0.89
feminist
0.89
azi
0.86
feminism
0.84
Feminist
0.81
galitarian
0.75
andom
0.74
issance
0.70
osphere
0.70
udi
0.70
Activations Density 0.017%