INDEX
Explanations
terms related to institutions or institutionalization, particularly emphasizing negative connotations
references to institutionalization and systemic issues
New Auto-Interp
Negative Logits
kin
-0.75
vous
-0.74
manship
-0.74
bane
-0.74
nen
-0.74
spell
-0.73
llan
-0.72
cius
-0.72
Woman
-0.72
hammad
-0.70
POSITIVE LOGITS
ized
1.36
ised
1.16
ization
1.11
izational
1.08
izable
1.01
izing
0.99
isations
0.99
inertia
0.98
izations
0.97
racism
0.97
Activations Density 0.084%