INDEX
Explanations
references to racial and ethnic disparities in healthcare access and experiences.
information related to medical conditions and demographics, especially focusing on racial and ethnic disparities.
This neuron appears to detect comparisons or contrasts between racial/ethnic groups, particularly focusing on mentions of white people compared to other racial minorities. It shows high activations for phrases that juxtapose different racial groups or discuss racial demographics and disparities. However, I want to note that
the word "white"
the word white
New Auto-Interp
Negative Logits
Jeografia
-0.81
Tikang
-0.64
CreateTagHelper
-0.63
fubject
-0.60
femininas
-0.59
featureID
-0.58
kysy
-0.58
ſeveral
-0.57
sogget
-0.57
ohjel
-0.57
POSITIVE LOGITS
white
2.15
white
1.85
White
1.81
White
1.72
WHITE
1.66
whites
1.50
WHITE
1.45
白
1.40
Whites
1.31
whites
1.28
Activations Density 0.336%