INDEX
Explanations
mentions of specific locations or teams in sports contexts
New Auto-Interp
Negative Logits
baru
-0.07
erdem
-0.07
ifestyles
-0.06
ore
-0.06
ANJI
-0.06
ÙħÙĤاÙħ
-0.06
hots
-0.05
apixel
-0.05
.Std
-0.05
acter
-0.05
POSITIVE LOGITS
team
0.11
Team
0.10
team
0.10
Team
0.09
ãĥģãĥ¼ãĥł
0.08
Group
0.08
_team
0.08
-team
0.08
ãĥ¼ãĥģ
0.08
íĮĢ
0.07
Activations Density 0.018%