INDEX
Explanations
references to chili dishes
New Auto-Interp
Negative Logits
*/(
-0.88
sonian
-0.87
roup
-0.82
cffff
-0.78
Reviewer
-0.77
deen
-0.76
allery
-0.75
Democr
-0.73
lie
-0.72
¶æ
-0.72
POSITIVE LOGITS
peppers
1.36
pepper
1.13
chili
1.08
powder
0.97
stove
0.94
iso
0.94
garlic
0.93
spicy
0.91
flakes
0.90
bean
0.90
Activations Density 0.005%