INDEX
Explanations
terms related to cutting and laser technology
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.06
3:0.05
4:0.04
5:0.05
6:0.27
7:0.13
8:0.04
9:0.05
10:0.10
11:0.10
Negative Logits
Sears
-1.33
pretext
-1.25
hift
-1.24
tricks
-1.22
alphabet
-1.19
Sauce
-1.18
lies
-1.17
unintended
-1.17
smells
-1.17
stride
-1.14
POSITIVE LOGITS
utor
1.69
onite
1.49
rison
1.37
gren
1.36
enced
1.36
idation
1.34
coni
1.33
cro
1.31
gradient
1.30
nih
1.30
Activations Density 0.001%