INDEX
Explanations
terms related to social justice and systemic inequalities
New Auto-Interp
Negative Logits
jacob
-0.70
fcntl
-0.69
thschild
-0.68
ت
-0.67
getB
-0.65
hwnd
-0.63
ண
-0.61
ילום
-0.60
т
-0.59
nought
-0.59
POSITIVE LOGITS
ized
1.34
ization
1.26
izations
1.23
urized
1.12
ize
1.11
IZED
1.07
Travelers
1.05
izing
1.03
ilized
1.02
IZATION
1.01
Activations Density 0.474%