INDEX
Explanations
mentions of the name "Heath" and variations of the word "death."
New Auto-Interp
Negative Logits
Zuk
-0.08
hoot
-0.07
iore
-0.07
andre
-0.07
erable
-0.06
haft
-0.06
arım
-0.06
gings
-0.06
venta
-0.06
udit
-0.06
POSITIVE LOGITS
erton
0.09
umu
0.08
ley
0.08
emb
0.07
field
0.07
fulness
0.07
land
0.07
ivet
0.07
row
0.06
wood
0.06
Activations Density 0.004%