INDEX
Explanations
the word "honey" with various contexts and forms, such as honey moon, honey pot, or simply as a term of endearment
references to honey
New Auto-Interp
Negative Logits
aneous
-0.85
aneously
-0.84
ATIONS
-0.77
uers
-0.73
igion
-0.72
================================================================
-0.72
osure
-0.71
ative
-0.70
ACTED
-0.70
andals
-0.70
POSITIVE LOGITS
moon
1.24
bee
1.12
bee
1.07
bees
1.06
honey
1.04
Honey
0.99
comb
0.98
bees
0.94
Bee
0.91
bean
0.89
Activations Density 0.005%