INDEX
Explanations
terms related to ethics and morality
moral character and implications
New Auto-Interp
Negative Logits
GenerationType
-0.59
<%=
-0.58
Sklici
-0.55
FetchType
-0.54
="@+
-0.54
#+#
-0.53
ühungen
-0.52
Reverso
-0.51
Shuk
-0.49
Lieb
-0.49
POSITIVE LOGITS
moral
1.13
Moral
1.05
moral
1.01
Moral
0.99
morales
0.97
morals
0.95
morally
0.88
morality
0.84
ethics
0.80
ethical
0.79
Activations Density 0.077%