INDEX
Head Attr Weights
0:0.02
1:0.01
2:0.04
3:0.04
4:0.03
5:0.03
6:0.34
7:0.28
8:0.04
9:0.04
10:0.05
11:0.04
Negative Logits
prosecut
-1.55
ウス
-1.35
▬
-1.35
ォ
-1.32
Choice
-1.32
cluding
-1.32
cruel
-1.31
tions
-1.27
ギ
-1.27
ATIONS
-1.25
POSITIVE LOGITS
raviolet
1.41
beh
1.39
geon
1.35
anu
1.34
origin
1.34
ohan
1.34
Pac
1.32
Apache
1.31
stream
1.31
Enterprise
1.30
Activations Density 0.000%