INDEX
Explanations
passages discussing sports events or games
New Auto-Interp
Head Attr Weights
0:0.21
1:0.04
2:0.05
3:0.08
4:0.04
5:0.02
6:0.13
7:0.15
8:0.02
9:0.05
10:0.10
11:0.06
Negative Logits
TT
-2.99
GC
-2.93
rh
-2.87
Tibet
-2.81
Oro
-2.67
Polar
-2.65
>>>
-2.64
Py
-2.63
Bott
-2.63
Yellowstone
-2.57
POSITIVE LOGITS
Falcons
6.73
Fal
5.39
Julio
4.22
Atlanta
4.21
Atlanta
3.95
Brees
3.76
Buccaneers
3.60
Bucs
3.37
Seahawks
3.30
aneers
3.24
Activations Density 0.005%