INDEX
Explanations
mentions of various organizations and events related to sports, labor, and social issues
New Auto-Interp
Negative Logits
rangle
-0.15
udge
-0.14
iox
-0.14
regular
-0.14
::_
-0.13
erve
-0.13
tro
-0.13
ienie
-0.13
comb
-0.13
-wise
-0.13
POSITIVE LOGITS
,#
0.24
#
0.22
hashtag
0.20
#w
0.20
/#
0.20
#g
0.18
#ad
0.17
|#
0.17
#af
0.16
#c
0.16
Activations Density 0.032%