INDEX
Negative Logits
bout
-0.28
substitute
-0.27
ople
-0.26
obel
-0.25
obby
-0.25
Voice
-0.25
substitutes
-0.25
åĮº
-0.24
title
-0.24
Veterans
-0.24
POSITIVE LOGITS
arpa
0.28
illes
0.26
éĨĽ
0.26
æŃ§è§Ĩ
0.25
rens
0.25
æ²¥éĿĴ
0.25
rale
0.25
pháp
0.24
个çϾåĪĨ
0.24
ALLOW
0.24
Activations Density 0.541%