INDEX
Explanations
terms related to physical or emotional pain
instances of the word "hurt."
New Auto-Interp
Negative Logits
DragonMagazine
-0.81
pires
-0.74
guyen
-0.73
ullivan
-0.73
iasco
-0.63
pedigree
-0.63
BuyableInstoreAndOnline
-0.62
velop
-0.61
Prospect
-0.60
ature
-0.60
POSITIVE LOGITS
ful
1.09
lessly
0.92
onies
0.92
fully
0.89
staking
0.86
feelings
0.86
ega
0.83
hurt
0.82
ted
0.82
ded
0.82
Activations Density 0.020%