INDEX
Explanations
interactions involving communication and insurance-related topics
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.08
3:0.13
4:0.06
5:0.04
6:0.02
7:0.04
8:0.06
9:0.09
10:0.23
11:0.16
Negative Logits
historically
-1.28
ensemble
-1.26
efeated
-1.25
zbollah
-1.11
inctions
-1.10
opoly
-1.09
intrinsically
-1.05
sustainability
-1.04
disproportionately
-1.04
Adapt
-1.02
POSITIVE LOGITS
upon
1.38
promptly
1.36
then
1.33
IRC
1.31
SPONSORED
1.31
Phone
1.27
Fill
1.18
REDACTED
1.17
mop
1.12
immediately
1.12
Activations Density 1.110%