INDEX
Explanations
references to significant ceremonial or political events
New Auto-Interp
Head Attr Weights
0:0.05
1:0.38
2:0.04
3:0.04
4:0.03
5:0.13
6:0.05
7:0.02
8:0.07
9:0.05
10:0.05
11:0.04
Negative Logits
uers
-1.99
SEC
-1.86
VERS
-1.70
Northwestern
-1.69
Reuters
-1.68
Tribune
-1.64
izont
-1.64
seys
-1.64
Republic
-1.63
ット
-1.61
POSITIVE LOGITS
ben
2.33
Benz
2.11
Be
2.04
enance
1.98
obe
1.97
Be
1.95
Bee
1.95
laun
1.91
bered
1.89
eb
1.86
Activations Density 0.002%