INDEX
Head Attr Weights
0:0.07
1:0.07
2:0.07
3:0.09
4:0.08
5:0.09
6:0.07
7:0.09
8:0.09
9:0.09
10:0.08
11:0.08
Negative Logits
Calories
-3.03
Reward
-3.03
Edison
-2.90
IPS
-2.89
Wikimedia
-2.83
EPS
-2.75
Redditor
-2.68
Alexis
-2.64
OPS
-2.63
Marcus
-2.63
POSITIVE LOGITS
hest
2.96
heres
2.80
latter
2.76
atch
2.67
fruitful
2.67
Born
2.50
pecially
2.49
ongo
2.39
itous
2.31
1987
2.31
Activations Density 0.000%