INDEX
Explanations
references to demographic representation and diversity in a social context
New Auto-Interp
Negative Logits
/Instruction
-0.17
å²
-0.15
Hosp
-0.14
ëıħ
-0.14
Monitor
-0.14
ebay
-0.14
rame
-0.13
ulan
-0.13
zan
-0.13
ovie
-0.13
POSITIVE LOGITS
Science
0.34
scientists
0.33
STEM
0.33
science
0.33
scientist
0.31
Science
0.30
Scientist
0.30
Scientists
0.30
scientific
0.29
Scientific
0.28
Activations Density 0.011%