INDEX
Explanations
punctuated phrases indicating events or actions
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.15
3:0.10
4:0.12
5:0.04
6:0.03
7:0.14
8:0.10
9:0.03
10:0.08
11:0.13
Negative Logits
�士
-1.45
Guinness
-1.38
ortment
-1.38
viation
-1.35
Ambro
-1.34
Gul
-1.31
��
-1.31
Founding
-1.28
ワン
-1.26
Malt
-1.26
POSITIVE LOGITS
uberty
1.47
horizon
1.39
])
1.36
wont
1.32
])
1.25
ENGTH
1.22
})
1.20
insert
1.19
ework
1.17
))
1.16
Activations Density 0.026%