INDEX
Explanations
references to media freedom and the treatment of journalists
New Auto-Interp
Negative Logits
opoulos
-0.19
μι
-0.15
resco
-0.15
ELS
-0.15
ccione
-0.14
Merchant
-0.14
nee
-0.14
NHS
-0.14
ادگÛĮ
-0.14
racial
-0.13
POSITIVE LOGITS
freedom
0.27
CP
0.25
Freedom
0.25
CP
0.24
Freedom
0.23
Ñģвоб
0.23
_fre
0.22
press
0.21
.cp
0.21
cp
0.21
Activations Density 0.014%