INDEX
Explanations
topics and discussions related to race, racism, and racial identity
New Auto-Interp
Negative Logits
RegressionTest
-0.78
RenderAtEndOf
-0.77
ChildScrollView
-0.75
enumii
-0.73
SequentialGroup
-0.71
Rüyada
-0.69
+#+#
-0.69
<unused79>
-0.67
<unused14>
-0.67
<pad>
-0.67
POSITIVE LOGITS
gender
0.46
identity
0.39
issues
0.39
classification
0.38
matters
0.37
differences
0.33
identities
0.32
parties
0.32
status
0.32
concerns
0.32
Activations Density 0.066%