INDEX
Explanations
references to writing improvement and techniques
New Auto-Interp
Head Attr Weights
0:0.37
1:0.02
2:0.14
3:0.09
4:0.02
5:0.08
6:0.03
7:0.04
8:0.04
9:0.05
10:0.03
11:0.03
Negative Logits
Chaff
-2.96
CCTV
-2.87
ramid
-2.62
wcsstore
-2.58
��
-2.55
Mub
-2.51
aband
-2.51
chwitz
-2.50
ndum
-2.44
lyak
-2.41
POSITIVE LOGITS
writing
6.29
writers
6.17
Writers
5.87
writing
5.82
Writ
5.74
Writing
5.70
literary
5.66
writer
5.66
Writing
5.59
manuscript
5.50
Activations Density 0.675%