INDEX
Explanations
descriptions of actions in a sports match
New Auto-Interp
Negative Logits
Canaver
-0.86
curric
-0.83
indust
-0.82
netflix
-0.79
ersive
-0.79
chwitz
-0.78
enegger
-0.78
forums
-0.77
onym
-0.77
academia
-0.75
POSITIVE LOGITS
scorer
1.11
stopp
1.04
halftime
1.04
teammate
1.02
scored
1.01
score
1.01
tackle
1.00
scoring
0.99
foul
0.95
touchdown
0.93
Activations Density 1.917%