INDEX
Explanations
mentions of the word "play" in the context of sports or performance
references to specific plays in sports contexts
New Auto-Interp
Negative Logits
«ĺ
-0.76
Apost
-0.69
unden
-0.67
Beir
-0.65
Unic
-0.64
++++++++++++++++
-0.64
exclusive
-0.63
pora
-0.62
orpor
-0.62
captivity
-0.62
POSITIVE LOGITS
ername
1.35
wright
1.27
maker
1.22
plays
1.08
makers
1.06
styles
0.99
making
0.99
wr
0.99
tes
0.98
testers
0.96
Activations Density 0.041%