INDEX
Explanations
references to Halloween activities and traditions
New Auto-Interp
Negative Logits
ante
-0.15
Fil
-0.15
copp
-0.15
egin
-0.14
FC
-0.14
essler
-0.14
surrogate
-0.14
skyt
-0.14
landa
-0.14
Fell
-0.14
POSITIVE LOGITS
Halloween
0.19
trick
0.17
Sugar
0.17
sugar
0.17
ç³ĸ
0.16
costumes
0.16
Trick
0.16
Door
0.15
tob
0.15
candy
0.15
Activations Density 0.025%