INDEX
Explanations
references to the food item "toast" and "gravy"
mentions of food items, particularly toast
New Auto-Interp
Negative Logits
crossings
-0.71
riages
-0.66
ologne
-0.64
ities
-0.63
andro
-0.62
chy
-0.61
ockey
-0.61
drawn
-0.59
pent
-0.58
Parenthood
-0.57
POSITIVE LOGITS
toast
1.69
Toast
1.53
antioxid
0.94
masters
0.94
ï¸
0.91
popcorn
0.86
cakes
0.84
alore
0.83
affles
0.81
emoji
0.80
Activations Density 0.004%