INDEX
Negative Logits
wett
-0.07
quant
-0.07
bar
-0.07
lidi
-0.07
tedy
-0.07
nice
-0.07
motiv
-0.07
midfielder
-0.07
turbines
-0.07
collections
-0.07
POSITIVE LOGITS
بعضها
0.10
correcto
0.09
ایه
0.09
诈骗
0.09
disguised
0.09
myths
0.08
გამოს
0.08
deceive
0.08
flawed
0.08
erroneous
0.08
Activations Density 0.015%