INDEX
Negative Logits
pard
-0.07
�
-0.07
increment
-0.06
ua
-0.06
arbitrarily
-0.06
Tu
-0.06
buz
-0.06
-letter
-0.06
kB
-0.06
'av
-0.06
POSITIVE LOGITS
sing
0.08
SENT
0.07
excerpt
0.06
charged
0.06
.hover
0.06
soccer
0.06
plays
0.06
gặp
0.06
craw
0.06
Overflow
0.06
Activations Density 0.000%