INDEX
Negative Logits
不了
-0.08
baze
-0.08
dilution
-0.07
_SB
-0.07
sau
-0.07
حت
-0.07
franch
-0.07
fire
-0.07
diluted
-0.07
Wizard
-0.07
POSITIVE LOGITS
flipping
0.09
flips
0.08
ров
0.08
flipped
0.08
Lenn
0.08
ǎ
0.07
ífico
0.07
driver
0.07
proprietário
0.07
seis
0.07
Activations Density 0.007%