INDEX
Negative Logits
hire
-0.90
nd
-0.81
ouf
-0.79
lain
-0.76
roxy
-0.75
nda
-0.74
rup
-0.71
arij
-0.70
ritz
-0.70
dep
-0.68
POSITIVE LOGITS
ized
0.89
digits
0.84
igr
0.83
ised
0.81
ographic
0.80
oded
0.79
itized
0.79
ization
0.77
eteen
0.76
umeric
0.75
Activations Density 0.027%