INDEX
Negative Logits
ç©¿è¡£
-0.27
enter
-0.25
tact
-0.25
barric
-0.25
ives
-0.25
æĥĨ
-0.25
ä¿ĥ
-0.24
sout
-0.24
Lim
-0.24
è¿Ľåħ¥
-0.24
POSITIVE LOGITS
previously
0.32
quote
0.32
originally
0.28
had
0.27
weeney
0.27
already
0.26
äºĨä¸Ģåı¥
0.26
wouldn
0.26
Quote
0.25
æĴ·
0.25
Activations Density 0.014%