INDEX
Explanations
specific mentions of vampires
references to vampires and vampire-related media
New Auto-Interp
Negative Logits
giving
-0.78
beh
-0.77
placed
-0.74
essler
-0.73
ear
-0.73
abis
-0.71
orative
-0.70
olulu
-0.69
rontal
-0.69
ional
-0.68
POSITIVE LOGITS
Vampire
1.19
vampires
1.05
vampire
1.01
ampire
0.98
Dracula
0.95
Masquerade
0.94
squid
0.93
wolves
0.83
ampires
0.83
Slayer
0.82
Activations Density 0.009%