INDEX
Explanations
mentions or references to sparks and ignition
occurrences of the word "spark."
New Auto-Interp
Negative Logits
ordan
-0.79
phia
-0.74
uphem
-0.70
guyen
-0.70
etheless
-0.70
terday
-0.66
hygiene
-0.66
parasites
-0.66
ibaba
-0.66
uthor
-0.66
POSITIVE LOGITS
plug
1.08
lers
0.93
le
0.93
lest
0.91
lar
0.89
LES
0.88
lights
0.85
led
0.84
ling
0.84
ris
0.84
Activations Density 0.014%