INDEX
Explanations
references to journalists and press freedom issues
New Auto-Interp
Negative Logits
rac
-0.15
orsch
-0.15
Merchant
-0.14
interracial
-0.14
racial
-0.14
yon
-0.14
opoulos
-0.14
race
-0.13
uo
-0.13
Treaty
-0.13
POSITIVE LOGITS
Freedom
0.26
freedom
0.25
Freedom
0.24
Ñģвоб
0.22
_fre
0.21
Freem
0.19
freel
0.18
fre
0.18
fre
0.18
freedoms
0.18
Activations Density 0.019%