INDEX
Explanations
references to philosophical concepts or figures, particularly in relation to Objectivism and value systems
New Auto-Interp
Negative Logits
elu
-0.15
šku
-0.15
mlink
-0.14
ucene
-0.14
ä»·
-0.13
ucci
-0.13
ONO
-0.13
Balance
-0.13
Vladim
-0.13
batis
-0.13
POSITIVE LOGITS
Rand
0.21
/rand
0.19
Atlas
0.19
peaceful
0.19
Hay
0.18
usta
0.18
Atlas
0.17
Roth
0.17
Rand
0.17
Ludwig
0.17
Activations Density 0.026%