INDEX
Explanations
fought againstplayed against
New Auto-Interp
Negative Logits
oyunc
0.40
Repeat
0.40
Debye
0.39
Spiel
0.38
ところに
0.38
etwas
0.36
carbonyl
0.35
வினை
0.35
けん
0.35
চালু
0.34
POSITIVE LOGITS
against
1.02
against
0.93
против
0.80
Against
0.80
Against
0.77
AGAINST
0.76
gegen
0.70
tegen
0.68
competitively
0.68
alongside
0.66
Activations Density 0.010%