INDEX
Explanations
references to existential themes and philosophical insights, particularly those related to Nietzsche's ideas
New Auto-Interp
Negative Logits
sie
-0.17
ey
-0.17
SCAN
-0.15
romance
-0.15
acob
-0.15
_SR
-0.15
elson
-0.15
kie
-0.14
_MR
-0.14
oken
-0.14
POSITIVE LOGITS
Zar
0.24
Superman
0.20
herd
0.19
Nietzsche
0.18
Niet
0.18
Wagner
0.17
Dion
0.17
Masks
0.16
Pars
0.16
masks
0.15
Activations Density 0.034%