INDEX
Explanations
opponent, adversary, rivalry
New Auto-Interp
Negative Logits
comando
0.47
flusso
0.44
pedals
0.44
控制
0.43
sostegno
0.43
fluidity
0.41
ângulo
0.40
cabo
0.40
elegantly
0.40
reintegr
0.39
POSITIVE LOGITS
对手
0.76
相手
0.74
opponent
0.72
adversary
0.70
对方
0.67
opponent
0.66
상대
0.64
對方
0.62
opponents
0.61
matchup
0.58
Activations Density 0.142%