INDEX
Negative Logits
IgnoreCase
-0.16
teil
-0.15
rieb
-0.15
erb
-0.14
ibre
-0.14
rule
-0.13
eria
-0.13
issing
-0.13
ãĥ¼
-0.13
ấ
-0.13
POSITIVE LOGITS
suppress
0.19
s
0.16
lava
0.14
ision
0.14
chal
0.13
ácil
0.13
座
0.13
inded
0.13
pez
0.13
/.
0.13
Activations Density 0.004%