INDEX
Explanations
references to horror-related themes and elements in writing
horror stories and movies
New Auto-Interp
Negative Logits
Sam
-0.49
sam
-0.47
Sam
-0.47
SAM
-0.46
Twee
-0.46
SAM
-0.45
Ap
-0.44
Samuel
-0.44
Pipe
-0.44
Am
-0.44
POSITIVE LOGITS
horror
2.03
horror
1.94
Horror
1.91
Horror
1.83
horrors
1.30
horrified
0.98
HOR
0.87
horri
0.87
horrifying
0.86
Horowitz
0.85
Activations Density 0.004%