INDEX
Explanations
mentions of sports teams and their performance
New Auto-Interp
Negative Logits
utterstock
-0.15
polator
-0.15
Derrick
-0.15
à¥įतà¤ķ
-0.15
trinsic
-0.14
utzer
-0.14
inds
-0.14
Hampton
-0.14
лаж
-0.14
ottle
-0.13
POSITIVE LOGITS
elop
0.17
ADOS
0.15
ITA
0.14
Rip
0.14
Forces
0.14
icol
0.14
ripp
0.14
atron
0.14
ripple
0.14
θν
0.14
Activations Density 0.046%