INDEX
Negative Logits
Buk
-0.08
bother
-0.07
on
-0.07
ूँ
-0.06
Ok
-0.06
unary
-0.06
ONENT
-0.06
Disconnected
-0.06
msgs
-0.06
qos
-0.06
POSITIVE LOGITS
目
0.06
sect
0.06
salt
0.06
classified
0.06
št
0.06
sculpt
0.06
bicycle
0.06
ูแล
0.06
characteristics
0.06
-commerce
0.06
Activations Density 0.003%