INDEX
Negative Logits
discover
-0.07
tricks
-0.07
perceive
-0.07
misunderstanding
-0.07
weapons
-0.06
Educational
-0.06
Enable
-0.06
developing
-0.06
>(&
-0.06
переход
-0.06
POSITIVE LOGITS
blogger
0.09
Blogger
0.09
blogging
0.08
bloggers
0.07
echn
0.07
_DH
0.06
mdb
0.06
ğ
0.06
oggle
0.06
äh
0.06
Activations Density 0.006%