INDEX
Explanations
blood-related words and phrases
references to blood and its various associations
New Auto-Interp
Negative Logits
awaru
-0.89
OPLE
-0.78
ECH
-0.76
IX
-0.75
VIDEOS
-0.74
Amend
-0.73
VICE
-0.72
ALS
-0.70
VERS
-0.68
Lank
-0.68
POSITIVE LOGITS
thirst
1.36
bath
1.22
hound
1.20
stained
1.16
lust
1.11
shed
1.01
thirsty
0.95
wine
0.92
vessels
0.92
spilled
0.89
Activations Density 0.017%