INDEX
Explanations
mentions of various types of potatoes
references to potatoes or potato-related dishes
New Auto-Interp
Negative Logits
anwhile
-0.85
olulu
-0.83
¥ŀ
-0.78
eanor
-0.77
stract
-0.74
student
-0.74
inances
-0.73
uilt
-0.73
¥µ
-0.72
issued
-0.72
POSITIVE LOGITS
potato
1.06
starch
1.05
potatoes
1.01
atoes
0.97
Potato
0.94
weed
0.93
cake
0.91
salad
0.91
chips
0.90
cris
0.88
Activations Density 0.011%