INDEX
Explanations
references to legal issues or copyright matters
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.12
3:0.14
4:0.23
5:0.04
6:0.15
7:0.02
8:0.04
9:0.08
10:0.06
11:0.03
Negative Logits
Twice
-1.31
Redd
-1.26
Restoration
-1.22
Slater
-1.21
waiver
-1.20
Admission
-1.18
Tomb
-1.18
Three
-1.16
Twenty
-1.16
Pur
-1.16
POSITIVE LOGITS
rael
1.58
izza
1.55
))))
1.53
andre
1.45
ui
1.41
�
1.39
oi
1.38
":{"1.38
)))
1.37
oba
1.35
Activations Density 0.006%