INDEX
Explanations
references to salads and healthy eating
New Auto-Interp
Negative Logits
Bake
-0.18
dough
-0.17
baker
-0.17
illac
-0.17
bake
-0.16
.gs
-0.16
baking
-0.16
-cookie
-0.16
ãĤ¤ãĤ¯
-0.16
lav
-0.15
POSITIVE LOGITS
dressing
0.31
salad
0.30
Salad
0.27
Dress
0.26
salads
0.26
iceberg
0.25
tossed
0.24
toss
0.24
dress
0.23
tossing
0.20
Activations Density 0.041%