INDEX
Explanations
references to pie, both as a food item and in metaphorical expressions
New Auto-Interp
Negative Logits
é
-0.17
ship
-0.16
naments
-0.15
ump
-0.15
ooter
-0.15
astle
-0.15
grind
-0.14
robe
-0.14
dess
-0.14
ÑĥÑģк
-0.14
POSITIVE LOGITS
_PATCH
0.16
bero
0.15
Odd
0.15
attery
0.15
ynes
0.14
ì¤ij
0.14
ovi
0.14
ovaly
0.14
elt
0.14
_patch
0.14
Activations Density 0.006%