INDEX
Explanations
mentions of sporting events and associated criticisms
New Auto-Interp
Negative Logits
rumor
-0.23
rumors
-0.23
favor
-0.22
rumored
-0.21
favors
-0.20
neighboring
-0.20
unfavor
-0.20
flavors
-0.20
favorable
-0.19
Defense
-0.18
POSITIVE LOGITS
-
0.20
United
0.20
0.17
Six
0.16
--
0.16
overload
0.16
City
0.15
https
0.15
amid
0.15
title
0.15
Activations Density 0.134%