INDEX
Explanations
phrases related to ethical considerations and emotional distress
New Auto-Interp
Negative Logits
ạt
-0.15
leth
-0.15
azard
-0.15
handy
-0.14
faults
-0.14
disastrous
-0.14
Haz
-0.14
Koh
-0.14
deadly
-0.14
ubbles
-0.13
POSITIVE LOGITS
pain
0.44
Pain
0.36
pain
0.35
agony
0.28
misery
0.26
çĹĽ
0.25
suffering
0.25
Äijau
0.24
pains
0.24
dolore
0.23
Activations Density 0.169%