INDEX
Explanations
mentions of popular food items or dishes
New Auto-Interp
Negative Logits
Garland
-0.17
adro
-0.15
holm
-0.15
dul
-0.15
Kok
-0.14
pij
-0.14
rael
-0.14
ofilm
-0.14
Hubbard
-0.14
colum
-0.14
POSITIVE LOGITS
burger
0.43
burgers
0.40
Burg
0.40
bun
0.37
Burger
0.36
burg
0.35
patt
0.35
burger
0.34
hamburg
0.32
hamburger
0.31
Activations Density 0.044%