INDEX
Explanations
terms related to shapes and physical forms
mentions of various types and forms of shapes
New Auto-Interp
Negative Logits
Mub
-0.84
onte
-0.76
unts
-0.74
aires
-0.70
amily
-0.65
Countdown
-0.61
arna
-0.61
Goodman
-0.61
roma
-0.61
ammy
-0.61
POSITIVE LOGITS
shif
0.97
shape
0.96
forms
0.89
shape
0.86
shapes
0.86
ly
0.83
Shape
0.82
liness
0.80
forming
0.77
sheet
0.77
Activations Density 0.038%