INDEX
Explanations
mentions of "honey" in the context of food or drink recipes
New Auto-Interp
Negative Logits
osure
-0.79
ative
-0.76
ŃĶ
-0.73
ATIONS
-0.73
uers
-0.72
aneous
-0.72
WATCHED
-0.69
ngth
-0.67
istical
-0.67
Rite
-0.66
POSITIVE LOGITS
moon
1.48
bees
1.24
comb
1.16
bee
1.14
pots
1.00
bean
0.98
cot
0.97
bee
0.97
pot
0.97
beans
0.95
Activations Density 0.023%