INDEX
Explanations
food-related words, particularly those related to pasta and noodles
references to pasta and noodle dishes
New Auto-Interp
Negative Logits
Tall
-0.77
Hamp
-0.73
Ft
-0.72
izons
-0.71
Austin
-0.71
sburg
-0.70
psy
-0.68
awks
-0.68
^^^^
-0.67
Houston
-0.67
POSITIVE LOGITS
noodles
1.22
pasta
1.13
nood
1.12
spaghetti
0.97
arella
0.97
slic
0.89
soup
0.89
cake
0.85
sauce
0.84
olini
0.83
Activations Density 0.008%