INDEX
Explanations
phrases related to freedom of speech
phrases related to concepts of freedom and speech
New Auto-Interp
Negative Logits
govtrack
-0.73
UF
-0.73
iron
-0.68
è£
-0.66
soDeliveryDate
-0.65
hoe
-0.62
Nusra
-0.62
failed
-0.62
stakes
-0.61
esville
-0.61
POSITIVE LOGITS
speech
1.01
expression
1.00
association
0.94
navigation
0.94
choice
0.92
conscience
0.91
expression
0.90
Expression
0.89
Speech
0.87
religion
0.85
Activations Density 0.028%