INDEX
Explanations
words related to food and dining experiences
New Auto-Interp
Negative Logits
Sher
-0.17
acock
-0.15
ertools
-0.14
ượu
-0.14
bows
-0.14
iola
-0.14
Sher
-0.14
-mf
-0.14
íĻķ
-0.14
pel
-0.13
POSITIVE LOGITS
bun
0.33
sandwich
0.32
SAND
0.30
sand
0.30
Sand
0.29
sand
0.29
sandwiches
0.27
Sandwich
0.27
slider
0.25
burger
0.25
Activations Density 0.071%