INDEX
Explanations
instances of specific people or entities involved in sports scandals or operations
New Auto-Interp
Negative Logits
Both
-0.22
both
-0.21
BOTH
-0.20
Both
-0.18
ernet
-0.18
annon
-0.17
两人
-0.17
both
-0.17
ambos
-0.15
ro
-0.14
POSITIVE LOGITS
these
0.28
è¿ĻäºĽ
0.28
these
0.28
above
0.25
Above
0.25
none
0.24
above
0.22
These
0.22
THESE
0.22
ABOVE
0.22
Activations Density 0.235%