INDEX
Explanations
mentions of the name "Jack"
instances of the name "Jack."
New Auto-Interp
Negative Logits
enrichment
-0.68
laureate
-0.68
ndra
-0.68
PDATE
-0.66
brief
-0.63
EMBER
-0.61
renaissance
-0.61
isation
-0.61
disapprove
-0.60
SYSTEM
-0.60
POSITIVE LOGITS
knife
1.13
pots
1.06
Sparrow
1.05
pot
0.99
oway
0.91
intosh
0.91
Straw
0.90
fruit
0.88
hammer
0.86
Benny
0.83
Activations Density 0.026%