INDEX
Explanations
terms related to social issues, especially those concerning gender identity and the practice of solitary confinement
terms related to identities and experiences of marginalized groups, particularly focusing on "cis" identities and the concept of solitary confinement
New Auto-Interp
Negative Logits
ingly
-0.97
Tycoon
-0.81
Seller
-0.77
ageddon
-0.72
eering
-0.72
Wife
-0.67
Races
-0.67
seller
-0.66
oing
-0.66
sung
-0.65
POSITIVE LOGITS
confinement
0.96
solitary
0.85
medi
0.76
ctor
0.75
cis
0.73
binary
0.73
char
0.71
tern
0.70
cer
0.70
cipl
0.69
Activations Density 0.018%