INDEX
Explanations
the concept of freedom
concepts related to freedom and expression
New Auto-Interp
Negative Logits
ergy
-0.85
IFIC
-0.77
amac
-0.74
itant
-0.72
Dynasty
-0.69
assium
-0.67
IENT
-0.66
liam
-0.66
recorded
-0.64
athan
-0.63
POSITIVE LOGITS
freedom
1.06
roam
0.98
freedom
0.96
freedoms
0.96
bies
0.92
captives
0.82
prisoners
0.78
unrestricted
0.77
prisoner
0.76
communism
0.74
Activations Density 0.021%