INDEX
Explanations
numerical values or statistics related to events or entities
New Auto-Interp
Head Attr Weights
0:0.02
1:0.05
2:0.19
3:0.11
4:0.02
5:0.03
6:0.19
7:0.07
8:0.09
9:0.05
10:0.07
11:0.05
Negative Logits
spoiler
-1.40
crop
-1.32
覚醒
-1.27
icing
-1.26
ster
-1.26
giveaway
-1.19
flavour
-1.17
PDATE
-1.16
feather
-1.15
cont
-1.13
POSITIVE LOGITS
conservancy
1.81
bilt
1.70
cffff
1.68
zona
1.57
wx
1.55
zanne
1.46
zie
1.45
taboola
1.45
onte
1.43
ilet
1.42
Activations Density 0.027%