INDEX
Explanations
food-related terms, specifically focused on burgers
references to the term "burger" or related variations
New Auto-Interp
Negative Logits
Sv
-0.77
ast
-0.76
Sri
-0.75
Circ
-0.74
Ach
-0.72
SY
-0.71
Ae
-0.70
Tam
-0.69
Az
-0.68
Nev
-0.67
POSITIVE LOGITS
burger
3.51
burgers
3.19
Burger
3.10
hamb
2.48
fries
1.97
steak
1.92
urger
1.65
taco
1.62
tacos
1.59
Grill
1.59
Activations Density 0.021%