INDEX
Explanations
mentions of bowls
mentions of bowls in various contexts
New Auto-Interp
Negative Logits
eanor
-0.65
nesota
-0.60
Anarchy
-0.59
Domin
-0.59
akin
-0.59
selves
-0.58
arresting
-0.58
Anton
-0.57
apse
-0.57
Agency
-0.57
POSITIVE LOGITS
bowl
1.15
bowl
1.12
bowls
0.98
hend
0.93
pipe
0.93
cup
0.89
Bowl
0.88
halla
0.79
stein
0.76
Å¡
0.75
Activations Density 0.007%