INDEX
Explanations
references to critical race theory and its applications in various contexts
New Auto-Interp
Negative Logits
kü
-0.17
crit
-0.17
criticisms
-0.16
Critics
-0.16
criticism
-0.16
criticised
-0.16
critics
-0.16
krit
-0.15
krit
-0.15
criticized
-0.15
POSITIVE LOGITS
ity
0.35
ITY
0.25
acclaim
0.25
-thinking
0.24
mass
0.24
thinking
0.23
Mass
0.21
jun
0.20
mass
0.20
thinker
0.19
Activations Density 0.015%