INDEX
Explanations
references to freedom-related terms
references to concepts of freedom and related rights
New Auto-Interp
Negative Logits
batch
-0.74
ARY
-0.73
oxic
-0.70
opsis
-0.67
acent
-0.66
eval
-0.65
offic
-0.64
etter
-0.64
nursery
-0.63
arily
-0.63
POSITIVE LOGITS
Freedom
4.07
Freedom
2.97
freedom
2.13
freedom
1.94
Liberty
1.83
Independence
1.72
freedoms
1.58
FOIA
1.51
liberty
1.43
FRE
1.35
Activations Density 0.019%