INDEX
Explanations
instances of the word "Hell" mentioned in the text
references to the word "Hell."
New Auto-Interp
Negative Logits
Random
-0.69
sshd
-0.68
Decre
-0.67
Dex
-0.66
Surveillance
-0.63
Borders
-0.62
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.61
Gly
-0.60
Recomm
-0.60
York
-0.60
POSITIVE LOGITS
enic
1.22
hound
1.13
ishly
1.11
bent
1.03
fire
1.01
bender
0.98
ish
0.94
ibur
0.93
rieg
0.93
spawn
0.90
Activations Density 0.026%