INDEX
Explanations
phrases indicating sports team performance and statistics
New Auto-Interp
Negative Logits
attering
-0.15
ucas
-0.15
oba
-0.14
ยà¸Ļ
-0.14
erson
-0.14
ypad
-0.14
015
-0.14
atee
-0.14
ropoda
-0.13
å¿ħè¦ģ
-0.13
POSITIVE LOGITS
favorites
0.19
favourites
0.17
without
0.17
neck
0.16
hoping
0.16
coast
0.16
favored
0.16
fifth
0.16
down
0.15
playing
0.15
Activations Density 0.058%