INDEX
Explanations
mentions of shapes or words related to shaping
mentions of the word "shape" in various contexts
New Auto-Interp
Negative Logits
artment
-0.72
amily
-0.71
govtrack
-0.71
Edited
-0.68
unts
-0.68
GS
-0.65
Mub
-0.65
BILITIES
-0.65
nea
-0.64
uers
-0.63
POSITIVE LOGITS
shape
1.04
shape
0.96
shif
0.88
cut
0.87
shapes
0.86
Shape
0.82
Shape
0.79
shaped
0.77
sheet
0.75
lier
0.73
Activations Density 0.021%