INDEX
Explanations
references to the concept of 'witch hunts'
references to witch hunts and witch-related terminology
New Auto-Interp
Negative Logits
phia
-0.77
IMAGES
-0.72
upon
-0.68
ais
-0.67
interrupted
-0.65
inished
-0.63
Fault
-0.63
IGH
-0.62
andro
-0.62
ateral
-0.61
POSITIVE LOGITS
doctor
1.22
hunt
0.93
sonian
0.91
haz
0.89
ry
0.89
ery
0.86
finder
0.83
tail
0.81
hattan
0.81
Hazel
0.79
Activations Density 0.049%