INDEX
Explanations
design principles related to text formatting and presentation
New Auto-Interp
Head Attr Weights
0:0.12
1:0.08
2:0.08
3:0.04
4:0.05
5:0.02
6:0.13
7:0.17
8:0.02
9:0.03
10:0.05
11:0.16
Negative Logits
Sheldon
-2.69
romeda
-2.43
Probe
-2.35
Noon
-2.33
Sands
-2.33
Shuttle
-2.30
Frontier
-2.29
ederation
-2.22
preparations
-2.19
Crack
-2.17
POSITIVE LOGITS
attribute
4.70
attributes
4.62
Attribute
4.16
attr
3.94
attribute
3.89
Attributes
3.82
Attributes
3.75
att
2.74
assigns
2.70
descriptor
2.64
Activations Density 0.009%