INDEX
Explanations
references to culinary experiences and cooking
New Auto-Interp
Negative Logits
Coffee
-0.17
Coffee
-0.16
caffe
-0.16
adultes
-0.15
thirsty
-0.15
tobacco
-0.15
fiz
-0.14
caffe
-0.14
Cotton
-0.14
βά
-0.14
POSITIVE LOGITS
cooking
0.45
cooks
0.42
kitchens
0.42
kitchen
0.41
chefs
0.41
chef
0.40
cook
0.40
Cooking
0.39
culinary
0.38
Cook
0.38
Activations Density 0.573%