INDEX
Explanations
titles and references to books discussing public philosophy and moral questions
New Auto-Interp
Negative Logits
inset
-0.07
unya
-0.06
ouro
-0.06
edly
-0.06
kers
-0.06
illon
-0.06
emiz
-0.06
azon
-0.06
dato
-0.06
Recap
-0.05
POSITIVE LOGITS
alink
0.08
Contains
0.07
åħ¨éĿ¢
0.07
resents
0.07
WithContext
0.07
This
0.06
<j
0.06
Contains
0.06
INCLUDE
0.06
978
0.06
Activations Density 0.015%