INDEX
Explanations
prominent names and references related to sports and entertainment
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.03
3:0.05
4:0.04
5:0.03
6:0.41
7:0.14
8:0.05
9:0.07
10:0.06
11:0.04
Negative Logits
Shape
-1.15
EMP
-1.15
diapers
-1.15
peninsula
-1.14
IUM
-1.14
leagues
-1.11
sterile
-1.10
GMT
-1.10
Applic
-1.09
ECA
-1.09
POSITIVE LOGITS
aughs
1.58
amins
1.52
arching
1.48
zen
1.46
zon
1.44
ogun
1.42
govtrack
1.33
arez
1.31
backer
1.30
kamp
1.29
Activations Density 0.003%