INDEX
Explanations
references to sporting events and scores
New Auto-Interp
Negative Logits
01
-0.15
alle
-0.15
eÅŁ
-0.15
409
-0.14
Centro
-0.14
éĺ
-0.14
Base
-0.14
.
-0.14
liv
-0.14
jes
-0.14
POSITIVE LOGITS
rout
0.28
easy
0.23
easy
0.21
landslide
0.21
ease
0.21
runaway
0.20
cruise
0.20
Cruise
0.20
Easy
0.19
overwhelming
0.19
Activations Density 0.183%