INDEX
Explanations
mentions of the word "honey" with varying levels of relevance
references to honey and its various forms or contexts
New Auto-Interp
Negative Logits
ŃĶ
-0.78
ative
-0.78
osure
-0.77
uers
-0.75
ATIONS
-0.70
istical
-0.69
aneous
-0.69
ista
-0.66
uing
-0.66
Rite
-0.65
POSITIVE LOGITS
moon
1.48
bees
1.24
comb
1.19
bee
1.14
pots
1.00
pot
0.95
bean
0.94
cot
0.94
beans
0.93
oney
0.91
Activations Density 0.035%