INDEX
Explanations
phrases or references to competitive sports and their impact on individuals
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.12
3:0.35
4:0.10
5:0.05
6:0.02
7:0.04
8:0.07
9:0.08
10:0.07
11:0.04
Negative Logits
)?
-1.90
pires
-1.85
"?
-1.80
ocaust
-1.77
?".
-1.74
chwitz
-1.69
=================================
-1.57
Reviewer
-1.56
uncture
-1.55
oğan
-1.55
POSITIVE LOGITS
yesterday
1.67
NT
1.65
Ples
1.51
waivers
1.41
Higgins
1.33
intending
1.32
RH
1.29
Brach
1.28
Hyundai
1.28
whilst
1.28
Activations Density 1.161%