INDEX
Negative Logits
zelf
-0.07
Katy
-0.06
k�
-0.06
STDMETHOD
-0.06
*/↵↵
-0.06
osloven
-0.06
sağlık
-0.06
ché
-0.06
_added
-0.06
دول
-0.06
POSITIVE LOGITS
neighboring
0.07
Filter
0.06
(chain
0.06
abused
0.06
mills
0.06
-provider
0.06
church
0.06
-append
0.06
mom
0.06
jeta
0.06
Activations Density 0.003%