INDEX
Explanations
references to the "Red" teams or entities, particularly in sports contexts
New Auto-Interp
Negative Logits
ILA
-0.73
merce
-0.73
awaru
-0.71
Ö¼
-0.71
SPONSORED
-0.71
4090
-0.68
FAULT
-0.68
incorpor
-0.68
ilities
-0.67
Lank
-0.65
POSITIVE LOGITS
eem
1.25
ucing
1.24
uces
1.18
uced
1.14
ucer
1.10
emption
1.09
irect
1.09
oubt
1.09
cliffe
1.06
neck
1.05
Activations Density 0.019%