INDEX
Explanations
references to food, particularly ice cream and dessert items
New Auto-Interp
Negative Logits
frying
-0.17
cooking
-0.16
ÅŁar
-0.16
bread
-0.16
oise
-0.16
Marketable
-0.15
cooked
-0.15
underwater
-0.15
Vine
-0.15
bread
-0.15
POSITIVE LOGITS
ice
0.63
Ice
0.54
ice
0.52
Ice
0.48
ICE
0.45
åĨ°
0.41
ICE
0.40
scoop
0.39
icy
0.35
sco
0.35
Activations Density 0.040%