INDEX
Explanations
patterns of specific conjunctions and qualifiers in text
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.14
3:0.10
4:0.11
5:0.10
6:0.04
7:0.06
8:0.09
9:0.06
10:0.11
11:0.06
Negative Logits
itutes
-1.11
atives
-1.03
itution
-1.00
arettes
-0.99
lled
-0.99
Charges
-0.99
arez
-0.98
Kill
-0.98
zag
-0.95
Expend
-0.92
POSITIVE LOGITS
nonetheless
1.44
etheless
1.36
persisted
1.36
conclud
1.34
importantly
1.29
nevertheless
1.27
acknow
1.25
alas
1.24
beware
1.24
prevailed
1.22
Activations Density 0.090%