INDEX
Negative Logits
lec
-0.18
ones
-0.16
òng
-0.15
ÑĥÑĢг
-0.15
irit
-0.14
lector
-0.14
ency
-0.14
oder
-0.13
tlement
-0.13
575
-0.13
POSITIVE LOGITS
phans
0.17
usi
0.16
nown
0.16
ally
0.15
thon
0.14
sost
0.14
medi
0.14
naments
0.14
ebin
0.14
mdir
0.13
Activations Density 0.221%