INDEX
Explanations
concepts and discussions related to morality and ethical standards
New Auto-Interp
Negative Logits
ویکیپدیا
-0.64
nozze
-0.63
UnknownFieldSet
-0.60
Herrmann
-0.60
瞩
-0.57
assailants
-0.57
Empereur
-0.56
erequisites
-0.56
<!--[
-0.55
rzeczyw
-0.55
POSITIVE LOGITS
moral
2.48
Moral
2.21
Moral
2.18
moral
2.13
morals
2.09
morality
2.03
ethics
2.00
ethical
1.93
morales
1.75
morally
1.74
Activations Density 0.163%