INDEX
Explanations
concepts related to freedom, particularly in political and economic contexts
New Auto-Interp
Negative Logits
arians
-0.16
lou
-0.15
riad
-0.15
.LENGTH
-0.15
py
-0.15
elastic
-0.14
stem
-0.14
ous
-0.14
lio
-0.14
Ñģп
-0.14
POSITIVE LOGITS
bies
0.29
bie
0.29
bsd
0.28
-standing
0.24
zing
0.23
zes
0.23
-floating
0.23
/free
0.22
zers
0.22
zer
0.21
Activations Density 0.049%