INDEX
Explanations
references to pie
instances of the word "pie."
New Auto-Interp
Negative Logits
spring
-0.80
iveness
-0.75
Students
-0.73
EDITION
-0.70
runners
-0.70
Examiner
-0.70
nen
-0.68
nesses
-0.67
sers
-0.66
iqueness
-0.64
POSITIVE LOGITS
pie
1.30
pies
1.24
pie
1.09
Pie
0.98
Pie
0.96
crust
0.91
desserts
0.89
artisan
0.83
ced
0.82
dough
0.80
Activations Density 0.013%