INDEX
Explanations
phrases related to diversity and multiculturalism
New Auto-Interp
Negative Logits
063
-0.14
داÙħ
-0.14
uffy
-0.14
.flash
-0.14
بÙĪØ§Ø³Ø·Ø©
-0.14
ils
-0.14
scaleY
-0.13
ursive
-0.13
imentary
-0.13
ucha
-0.13
POSITIVE LOGITS
diversity
0.27
foreign
0.26
Diversity
0.24
foreign
0.24
Foreign
0.22
Foreign
0.22
_foreign
0.21
global
0.20
divers
0.19
-global
0.19
Activations Density 0.292%