INDEX
Explanations
words related to punishment or retribution
references to punishment and its implications
New Auto-Interp
Negative Logits
sonian
-0.91
coe
-0.90
gow
-0.90
soDeliveryDate
-0.80
roots
-0.76
ophe
-0.74
zyme
-0.73
ortment
-0.72
ouf
-0.71
eds
-0.71
POSITIVE LOGITS
harshly
0.92
punished
0.91
punishments
0.86
inflicted
0.86
punishment
0.83
severely
0.81
punish
0.79
tant
0.76
vanquished
0.73
humiliation
0.73
Activations Density 0.033%