INDEX
Negative Logits
uminate
-0.08
igth
-0.08
häufiger
-0.08
Jinping
-0.08
iktig
-0.07
الأكثر
-0.07
营
-0.07
arranged
-0.07
abundant
-0.07
Dish
-0.07
POSITIVE LOGITS
unwanted
0.12
errone
0.10
pesky
0.10
craps
0.10
incorrectly
0.09
undes
0.09
inadvertently
0.09
bothers
0.08
ios
0.08
mistakenly
0.08
Activations Density 0.012%