INDEX
Explanations
matchups and scores from sports events
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.18
3:0.06
4:0.16
5:0.04
6:0.06
7:0.04
8:0.04
9:0.05
10:0.16
11:0.10
Negative Logits
expiration
-1.51
swer
-1.46
duplicate
-1.43
informing
-1.37
invention
-1.33
WHERE
-1.33
handwriting
-1.29
clues
-1.26
oscope
-1.26
tutorial
-1.26
POSITIVE LOGITS
pound
1.51
McMahon
1.48
ohyd
1.45
ongh
1.41
Anderson
1.38
urized
1.33
weights
1.32
Composite
1.31
Josh
1.28
Buff
1.28
Activations Density 0.002%