INDEX
Explanations
thematic elements related to moral and ethical dilemmas
New Auto-Interp
Negative Logits
enberg
-0.17
Integral
-0.16
antal
-0.14
rie
-0.14
rias
-0.14
DAT
-0.14
ahoo
-0.14
ria
-0.14
Zuk
-0.13
Ïģι
-0.13
POSITIVE LOGITS
environ
0.19
dest
0.18
sensible
0.17
act
0.17
conscious
0.16
minded
0.16
disposed
0.16
benefited
0.16
sunk
0.16
convers
0.15
Activations Density 0.130%