INDEX
Explanations
phrases related to civil liberties and rights
references to civil liberties and freedoms
New Auto-Interp
Negative Logits
HEAD
-0.74
ded
-0.72
acs
-0.72
Bi
-0.67
Hyper
-0.67
mentioned
-0.67
ripp
-0.65
ANN
-0.65
NC
-0.65
ptive
-0.64
POSITIVE LOGITS
liberties
1.46
Liberties
1.23
freedoms
1.18
ktop
1.02
rights
0.91
Rights
0.89
liberty
0.88
ervative
0.85
ervatives
0.80
safeguards
0.78
Activations Density 0.003%