INDEX
Negative Logits
or
-0.08
rounded
-0.08
-0.08
within
-0.08
24
-0.08
content
-0.08
.
-0.07
या
-0.07
ya
-0.07
();
-0.07
POSITIVE LOGITS
relação
0.09
wikipedia
0.09
ちなみに
0.09
welk
0.09
Congrats
0.09
Wifi
0.09
Walmart
0.08
hilarious
0.08
przec
0.08
nade
0.08
Activations Density 0.011%