INDEX
Explanations
references to sports activities involving teamwork and competition
New Auto-Interp
Negative Logits
ersh
-0.14
DDS
-0.14
uche
-0.14
ingles
-0.14
#
-0.14
Peg
-0.14
Responsive
-0.14
indle
-0.14
eu
-0.14
pg
-0.14
POSITIVE LOGITS
Rugby
0.33
rugby
0.30
scr
0.28
hookers
0.26
rug
0.25
Barbar
0.23
try
0.23
rug
0.22
Test
0.22
RFC
0.22
Activations Density 0.082%