INDEX
Negative Logits
pedia
-0.14
ively
-0.12
bilt
-0.12
ered
-0.12
اÙĨ
-0.11
paper
-0.11
ariat
-0.11
enment
-0.11
esel
-0.11
mente
-0.11
POSITIVE LOGITS
ming
0.44
mer
0.34
med
0.30
my
0.25
mers
0.24
MING
0.24
ms
0.18
mys
0.18
bers
0.18
me
0.17
Activations Density 0.062%