INDEX
Explanations
references to witches or witchcraft
references to witches and witchcraft
New Auto-Interp
Negative Logits
egal
-0.83
inished
-0.79
ournal
-0.74
ributed
-0.74
served
-0.72
rained
-0.72
upon
-0.68
rez
-0.66
jri
-0.66
haar
-0.66
POSITIVE LOGITS
Witch
1.00
doctor
0.95
witch
0.86
hunts
0.81
haz
0.81
Hazel
0.79
witch
0.78
CLSID
0.76
ERY
0.76
ãĥīãĥ©
0.75
Activations Density 0.011%