INDEX
Explanations
actions and intentions related to sports teams and their performance
New Auto-Interp
Negative Logits
еÑĢо
-0.15
ucc
-0.15
804
-0.15
zwarte
-0.14
OLS
-0.14
dove
-0.14
colored
-0.14
oste
-0.14
elize
-0.14
loo
-0.14
POSITIVE LOGITS
fancy
0.21
pip
0.19
gate
0.17
capital
0.17
feature
0.17
blood
0.17
troubling
0.16
rubber
0.16
sav
0.16
pit
0.15
Activations Density 0.059%