INDEX
Explanations
references to scholarship and academic discussions on social justice issues
New Auto-Interp
Negative Logits
963
-0.17
ensi
-0.17
inval
-0.15
ahy
-0.15
Ahmad
-0.14
redient
-0.14
ikhail
-0.14
Mrs
-0.14
adem
-0.13
oksen
-0.13
POSITIVE LOGITS
Prof
0.27
Professor
0.26
professor
0.26
soci
0.24
prof
0.24
Professor
0.23
PROF
0.23
Prof
0.22
prof
0.22
historian
0.22
Activations Density 0.248%