INDEX
Explanations
narratives about sports events and results
New Auto-Interp
Negative Logits
teammate
-0.15
olumn
-0.15
ornings
-0.15
prung
-0.14
agli
-0.14
лиж
-0.14
.sg
-0.14
alleng
-0.14
iger
-0.14
ording
-0.14
POSITIVE LOGITS
between
0.20
evenly
0.18
between
0.18
BETWEEN
0.17
Between
0.17
uds
0.17
dynamic
0.17
dy
0.16
Between
0.16
vetica
0.16
Activations Density 0.341%