INDEX
Explanations
phrases related to suffering and pain
terms related to suffering and its impact on individuals and society
New Auto-Interp
Negative Logits
ioch
-0.78
afort
-0.75
ificantly
-0.73
sheet
-0.73
smoking
-0.72
Dub
-0.68
cryptoc
-0.66
leans
-0.65
ENC
-0.65
agall
-0.65
POSITIVE LOGITS
endured
1.18
inflicted
1.10
awaits
0.86
suffered
0.83
untold
0.80
Wage
0.79
plight
0.79
plag
0.78
miser
0.77
await
0.76
Activations Density 0.144%