INDEX
Explanations
mentions of freedom of speech, expression, and related rights and issues
New Auto-Interp
Negative Logits
BALL
-0.67
CET
-0.66
tom
-0.64
VIS
-0.64
ARDS
-0.64
Gord
-0.63
Tele
-0.63
Stab
-0.63
cit
-0.62
git
-0.61
POSITIVE LOGITS
rights
1.07
protections
0.94
Rights
0.94
speech
0.93
freedoms
0.92
liberties
0.91
rights
0.89
freedom
0.83
advocates
0.83
pamph
0.83
Activations Density 0.063%