INDEX
Explanations
expressions related to struggle and social justice
New Auto-Interp
Negative Logits
Impl
-0.15
ythe
-0.14
μι
-0.14
Mein
-0.14
otta
-0.13
moil
-0.13
ethnicity
-0.13
ethnic
-0.13
Nag
-0.13
éĤª
-0.13
POSITIVE LOGITS
TL
0.16
Visibility
0.15
femme
0.15
gifs
0.15
GIF
0.15
erais
0.14
psc
0.14
Visibility
0.14
visibility
0.14
ostel
0.14
Activations Density 0.609%