INDEX
Explanations
mentions of various types of worms and the word 'Worms' with a capital 'W'
references to worms and related terms
New Auto-Interp
Negative Logits
arresting
-0.73
owered
-0.72
ourt
-0.71
orney
-0.71
orically
-0.70
Palestin
-0.70
seiz
-0.70
ournal
-0.69
liction
-0.69
yles
-0.69
POSITIVE LOGITS
hole
1.36
worms
1.22
holes
1.15
worm
1.13
tail
1.13
roots
1.01
worms
0.91
fish
0.87
Worm
0.83
pool
0.82
Activations Density 0.041%