INDEX
Explanations
references to sports teams and their performances
New Auto-Interp
Negative Logits
robat
-0.17
.sav
-0.17
thái
-0.15
Shank
-0.15
ignet
-0.15
successfully
-0.15
åĨł
-0.14
gnore
-0.14
IMA
-0.14
زاÙĨ
-0.14
POSITIVE LOGITS
fal
0.32
struggle
0.30
struggles
0.28
succ
0.27
lim
0.27
fail
0.24
struggled
0.24
suffer
0.23
struggling
0.23
lost
0.23
Activations Density 0.117%