INDEX
Negative Logits
combe
-0.18
ful
-0.17
(es
-0.17
geber
-0.16
orem
-0.16
ustr
-0.16
.compat
-0.15
ym
-0.15
boro
-0.15
jos
-0.15
POSITIVE LOGITS
.au
0.49
.ua
0.31
.br
0.30
.mx
0.28
.bd
0.24
.cn
0.23
.tw
0.23
.ng
0.23
.cy
0.22
.ar
0.22
Activations Density 0.039%