INDEX
Explanations
sentences related to economic issues and legal consequences
New Auto-Interp
Head Attr Weights
0:0.10
1:0.02
2:0.11
3:0.12
4:0.04
5:0.15
6:0.05
7:0.03
8:0.14
9:0.08
10:0.10
11:0.03
Negative Logits
spection
-1.29
acknowled
-1.28
OPLE
-1.22
alliances
-1.20
exceptions
-1.20
hindsight
-1.20
ework
-1.18
Strategies
-1.16
assumptions
-1.16
RELEASE
-1.14
POSITIVE LOGITS
onto
1.30
ogie
1.17
Built
1.10
rightfully
1.10
cu
1.09
Located
1.06
greeted
1.05
AVG
1.04
welcomed
1.01
fitted
0.99
Activations Density 0.727%