INDEX
Explanations
sports events where a team wins
instances of sports team victories
New Auto-Interp
Negative Logits
UX
-0.77
uum
-0.77
illary
-0.76
redits
-0.73
OCK
-0.72
igmat
-0.70
onge
-0.68
olutions
-0.66
INESS
-0.65
assian
-0.65
POSITIVE LOGITS
hang
0.80
clock
0.75
lord
0.75
rolet
0.72
hung
0.70
him
0.69
fellow
0.66
rist
0.66
whelming
0.64
Tara
0.63
Activations Density 0.019%