INDEX
Negative Logits
Kon
-0.07
abc
-0.07
presum
-0.06
pict
-0.06
causes
-0.06
Produk
-0.06
Telecom
-0.06
tzv
-0.06
Translate
-0.06
όγ
-0.06
POSITIVE LOGITS
idend
0.07
ends
0.07
loại
0.06
ド
0.06
または
0.06
madrid
0.06
ัจ
0.06
fol
0.06
intersection
0.06
underworld
0.06
Activations Density 0.008%