INDEX
Explanations
mentions of the word "ghost" with varying levels of emphasis
references to ghosts
New Auto-Interp
Negative Logits
erity
-0.84
unes
-0.74
ibaba
-0.72
ighed
-0.70
une
-0.68
itsch
-0.68
undred
-0.67
NAD
-0.66
udeau
-0.66
verson
-0.66
POSITIVE LOGITS
ghost
1.07
busters
1.06
ghost
0.97
buster
0.91
glass
0.90
writer
0.90
ghosts
0.85
writing
0.84
Haunted
0.83
Bunny
0.82
Activations Density 0.010%