INDEX
Explanations
words related to freedom
phrases related to freedom and its various implications
New Auto-Interp
Negative Logits
ergy
-0.82
Cosponsors
-0.70
ded
-0.69
itant
-0.69
IFIC
-0.68
liam
-0.67
IENT
-0.67
recorded
-0.66
ENTS
-0.65
nas
-0.65
POSITIVE LOGITS
freedom
1.05
freedom
0.98
freedoms
0.98
bies
0.97
roam
0.91
captives
0.80
unrestricted
0.77
boats
0.76
emancipation
0.75
yip
0.74
Activations Density 0.019%