INDEX
Explanations
plays and specific outcomes of sports events
New Auto-Interp
Negative Logits
pel
-0.17
anka
-0.15
utory
-0.14
Kaynak
-0.14
egas
-0.14
istant
-0.14
AXB
-0.13
erot
-0.13
XCTAssertTrue
-0.13
odate
-0.13
POSITIVE LOGITS
runners
0.29
bases
0.24
runner
0.23
runner
0.21
bases
0.20
Runner
0.19
bas
0.19
Runner
0.18
-runner
0.18
inherited
0.17
Activations Density 0.025%