INDEX
Explanations
instances of the word "candy"
New Auto-Interp
Negative Logits
yon
-0.71
heed
-0.70
transcript
-0.70
chron
-0.68
semble
-0.64
iets
-0.63
Hutchinson
-0.62
Published
-0.61
inel
-0.61
plur
-0.60
POSITIVE LOGITS
cane
1.22
vine
0.93
bucks
0.93
strip
0.93
pill
0.92
candy
0.91
flake
0.90
Crush
0.88
bar
0.87
bowl
0.87
Activations Density 0.030%