INDEX
Explanations
significant mentions of political representation and diversity
New Auto-Interp
Negative Logits
฿
-0.16
fin
-0.14
illin
-0.14
provinces
-0.14
ÑĢÑı
-0.14
verity
-0.14
_DECLARE
-0.13
elda
-0.13
ØŃت
-0.13
llum
-0.13
POSITIVE LOGITS
immigrant
0.31
immigrants
0.28
immigration
0.28
ethnic
0.27
Immigration
0.27
Ethnic
0.27
-imm
0.25
immigr
0.25
dias
0.24
ethnic
0.24
Activations Density 0.679%