INDEX
Explanations
references to systemic issues or challenges faced by communities
New Auto-Interp
Negative Logits
lico
-0.17
persecuted
-0.15
TORT
-0.15
urette
-0.15
Province
-0.15
Všech
-0.14
ç¾
-0.14
orks
-0.14
orrh
-0.14
etik
-0.14
POSITIVE LOGITS
inner
0.33
neighborhood
0.31
community
0.28
crack
0.28
inner
0.28
housing
0.26
gang
0.26
African
0.25
block
0.24
Inner
0.24
Activations Density 0.283%