INDEX
Explanations
references to media and press freedom, particularly related to journalism
New Auto-Interp
Negative Logits
rac
-0.15
opat
-0.15
orsch
-0.14
resco
-0.14
allax
-0.14
nee
-0.14
iov
-0.14
racial
-0.14
yon
-0.14
overe
-0.13
POSITIVE LOGITS
Freedom
0.26
freedom
0.25
Freedom
0.23
Journalism
0.23
freel
0.22
Ñģвоб
0.22
journalists
0.22
journalism
0.21
_fre
0.20
Freel
0.20
Activations Density 0.023%