INDEX
Explanations
mentions of political scandals and investigations
New Auto-Interp
Negative Logits
traseiro
-0.71
móvel
-0.62
semelh
-0.60
vermelhas
-0.59
femininos
-0.58
dianteiro
-0.57
abstrato
-0.56
gouttes
-0.55
brancas
-0.54
libremente
-0.54
POSITIVE LOGITS
Brazilian
1.01
Brazil
0.95
Brazilian
0.88
Brazil
0.87
brazilian
0.82
Brasil
0.82
São
0.81
Braz
0.79
BRAZIL
0.79
BRL
0.77
Activations Density 0.277%