INDEX
Explanations
sentences that conclude with strong punctuation or statements
New Auto-Interp
Head Attr Weights
0:0.12
1:0.11
2:0.09
3:0.07
4:0.06
5:0.05
6:0.10
7:0.07
8:0.05
9:0.05
10:0.13
11:0.06
Negative Logits
MpServer
-3.65
Ur
-3.27
Harbaugh
-3.25
Arpaio
-3.07
Aval
-3.05
baugh
-3.03
XY
-3.00
Seg
-2.99
王
-2.97
ayers
-2.88
POSITIVE LOGITS
Natalie
6.30
atalie
4.30
Judd
3.60
nit
3.51
Nit
3.24
NAS
3.15
Nigel
2.97
Swan
2.81
reens
2.77
Nom
2.77
Activations Density 0.000%