INDEX
Explanations
punctuation and structural elements within text
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.08
3:0.07
4:0.05
5:0.04
6:0.35
7:0.05
8:0.03
9:0.07
10:0.09
11:0.07
Negative Logits
Tycoon
-1.61
ashtra
-1.50
ufact
-1.49
STER
-1.45
Kal
-1.42
inen
-1.41
¯
-1.37
inki
-1.36
loo
-1.35
emouth
-1.35
POSITIVE LOGITS
cc
1.57
ascript
1.38
Refresh
1.33
Spread
1.33
catentry
1.28
pp
1.17
speakers
1.16
speech
1.15
Entry
1.15
tick
1.13
Activations Density 0.001%