INDEX
Negative Logits
perme
-0.16
sn
-0.15
wap
-0.14
Starr
-0.14
ä»
-0.14
Ìī
-0.14
fin
-0.14
cx
-0.14
quil
-0.14
super
-0.13
POSITIVE LOGITS
mps
0.17
idla
0.16
bil
0.15
).__
0.15
elper
0.15
Ortiz
0.15
isoft
0.14
owitz
0.14
uby
0.14
chner
0.14
Activations Density 0.003%