INDEX
Explanations
negations or qualifiers indicating doubt or uncertainty
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.06
4:0.06
5:0.03
6:0.15
7:0.40
8:0.02
9:0.03
10:0.04
11:0.06
Negative Logits
orders
-1.62
deliveries
-1.60
unia
-1.57
Orders
-1.44
覚醒
-1.43
Purchase
-1.35
along
-1.35
carts
-1.35
Blossom
-1.34
Lear
-1.33
POSITIVE LOGITS
pose
1.57
iour
1.40
hed
1.37
Picture
1.36
imensional
1.33
dent
1.32
IPM
1.30
CVE
1.30
polit
1.29
olars
1.28
Activations Density 0.073%