INDEX
Explanations
descriptions or mentions of shapes with circular attributes
the concept of "circle" as it is mentioned frequently in various contexts
New Auto-Interp
Negative Logits
Bundes
-0.74
ãĥ¤
-0.71
ractive
-0.70
ensional
-0.66
iary
-0.64
icating
-0.64
ior
-0.63
inse
-0.63
owntown
-0.63
iciency
-0.62
POSITIVE LOGITS
circle
1.02
naire
0.91
Circle
0.86
circle
0.84
jerk
0.81
jer
0.80
wheel
0.80
cules
0.80
circles
0.79
naires
0.77
Activations Density 0.008%