INDEX
Explanations
references to sports teams and competitions
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.04
4:0.07
5:0.03
6:0.39
7:0.09
8:0.02
9:0.03
10:0.10
11:0.08
Negative Logits
etsk
-1.30
omission
-1.27
hus
-1.26
inherent
-1.19
usive
-1.19
Fargo
-1.16
assetsadobe
-1.15
Jones
-1.11
lycer
-1.11
chore
-1.10
POSITIVE LOGITS
lished
1.45
thur
1.41
ntil
1.38
��
1.34
merce
1.34
Telescope
1.31
States
1.31
ainer
1.29
aroo
1.26
Regions
1.25
Activations Density 0.003%