INDEX
Explanations
mentions of the word "sugar."
references to sugar and its effects
New Auto-Interp
Negative Logits
ership
-0.81
naire
-0.79
acular
-0.78
FML
-0.75
agame
-0.73
atche
-0.73
ļé
-0.73
Downloadha
-0.71
ĪĴ
-0.70
atform
-0.70
POSITIVE LOGITS
beet
1.10
syrup
1.08
cane
1.07
daddy
0.86
sugar
0.85
coat
0.78
coating
0.78
mell
0.77
moon
0.76
water
0.76
Activations Density 0.018%