INDEX
Explanations
terms related to the concept of freedom
references to the concept of freedom in various contexts
New Auto-Interp
Negative Logits
Takeru
-0.85
sidx
-0.80
therap
-0.77
URRENT
-0.72
amel
-0.71
senal
-0.69
older
-0.69
ulous
-0.69
ENTION
-0.68
Archdemon
-0.67
POSITIVE LOGITS
bies
1.29
zers
1.00
bie
1.00
boot
0.98
roam
0.97
zing
0.93
edom
0.89
bern
0.84
zes
0.83
holders
0.78
Activations Density 0.038%