INDEX
Negative Logits
Hood
-0.15
Shannon
-0.15
urus
-0.15
ÑĪел
-0.15
usercontent
-0.14
uhl
-0.14
uffman
-0.14
fro
-0.13
sk
-0.13
8
-0.13
POSITIVE LOGITS
azi
0.20
ubat
0.19
ormsg
0.18
oca
0.16
aders
0.16
Tort
0.16
tort
0.15
zac
0.15
eyJ
0.15
pent
0.15
Activations Density 0.019%