INDEX
Explanations
themes related to societal issues and the impact of ideology on freedom and morality
New Auto-Interp
Negative Logits
ugas
-0.20
antom
-0.16
omba
-0.15
oproject
-0.15
iddi
-0.14
quence
-0.14
redient
-0.14
ãĥĩãĥ«
-0.14
çľĭçľĭ
-0.14
ipi
-0.14
POSITIVE LOGITS
isser
0.15
mani
0.14
iral
0.14
ISIBLE
0.14
inç
0.14
åĿĬ
0.14
ilk
0.14
.opt
0.14
صÙģ
0.14
nh
0.14
Activations Density 0.471%