INDEX
Explanations
terms related to societal issues, such as poverty, violence, discrimination, and regulations
issues related to systemic inequality and social justice
New Auto-Interp
Negative Logits
¬¼
-0.70
itone
-0.69
ģĸ
-0.64
ãĤ¦ãĤ¹
-0.64
quet
-0.63
atically
-0.63
itus
-0.61
lor
-0.60
identally
-0.59
ady
-0.59
POSITIVE LOGITS
etc
0.95
namely
0.76
sed
0.76
including
0.75
albeit
0.75
Magikarp
0.69
but
0.67
whereas
0.66
which
0.64
advertising
0.63
Activations Density 0.273%