INDEX
Negative Logits
anny
-0.17
ongo
-0.15
ante
-0.15
cola
-0.15
ave
-0.14
Spin
-0.14
arte
-0.14
esi
-0.14
wich
-0.14
ako
-0.14
POSITIVE LOGITS
Umb
0.17
اسب
0.16
ÑĪки
0.15
ething
0.14
æ£
0.14
ockets
0.14
yssey
0.14
erif
0.14
dictions
0.14
lict
0.13
Activations Density 0.014%