INDEX
Explanations
references to legal and human rights concepts such as freedom of expression and freedom of information
phrases related to freedom, especially in contexts of expression and information
New Auto-Interp
Negative Logits
UF
-0.85
forth
-0.71
ALD
-0.70
hoe
-0.69
è£
-0.69
iour
-0.68
soDeliveryDate
-0.67
ãĤ©
-0.67
ATIONAL
-0.65
neau
-0.64
POSITIVE LOGITS
navigation
0.85
expression
0.84
speech
0.83
choice
0.82
expression
0.81
inquiry
0.79
Expression
0.76
conscience
0.76
association
0.72
speech
0.70
Activations Density 0.033%