INDEX
Explanations
references to inflicting pain and suffering, particularly concerning ethical dilemmas related to animals and human experiences
New Auto-Interp
Negative Logits
Hochspringen
-0.54
connexes
-0.53
<bos>
-0.51
Effectiveness
-0.50
tagHelper
-0.49
referenties
-0.46
')['
-0.44
ganet
-0.44
Verhältnis
-0.44
seende
-0.43
POSITIVE LOGITS
misery
1.08
injury
1.03
pain
1.03
harm
1.02
illness
1.02
distress
0.99
sickness
0.97
unhappiness
0.97
hardship
0.96
disorder
0.95
Activations Density 1.014%