INDEX
Head Attr Weights
0:0.09
1:0.08
2:0.06
3:0.08
4:0.08
5:0.08
6:0.08
7:0.08
8:0.08
9:0.08
10:0.07
11:0.08
Negative Logits
Rove
-2.24
downgrade
-2.09
Models
-2.08
Reloaded
-2.07
rob
-2.06
Beam
-2.03
refurb
-2.01
BB
-2.00
RP
-2.00
ulatory
-1.99
POSITIVE LOGITS
essen
2.22
luck
2.15
EGIN
2.13
ENT
2.11
English
2.09
ENTS
2.09
GOODMAN
2.09
anx
2.02
France
2.00
ée
1.95
Activations Density 0.000%