INDEX
Explanations
words related to blood
references to blood and related themes of violence or sacrifice
New Auto-Interp
Negative Logits
awaru
-0.90
Amend
-0.82
IX
-0.76
Lank
-0.71
OPLE
-0.70
acle
-0.68
merce
-0.68
VICE
-0.67
srfAttach
-0.67
Spac
-0.66
POSITIVE LOGITS
thirst
1.49
bath
1.31
hound
1.31
stained
1.26
lust
1.17
shed
1.12
lines
1.01
spl
0.91
thirsty
0.90
shot
0.90
Activations Density 0.027%