INDEX
Explanations
references to sections and summaries in written content
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.07
4:0.09
5:0.03
6:0.05
7:0.33
8:0.05
9:0.03
10:0.07
11:0.14
Negative Logits
excuse
-1.42
Canaver
-1.30
893
-1.29
ヘ
-1.25
erest
-1.21
refrain
-1.21
rone
-1.21
Meredith
-1.19
Hib
-1.19
911
-1.18
POSITIVE LOGITS
Installation
1.67
snipp
1.58
chromos
1.56
NetMessage
1.52
tions
1.51
ilities
1.51
�
1.48
puter
1.43
Flavoring
1.39
successful
1.36
Activations Density 0.002%