INDEX
Negative Logits
Äįan
-0.16
ribbon
-0.15
ritten
-0.15
viron
-0.15
dden
-0.15
baugh
-0.15
phin
-0.15
IFn
-0.14
jours
-0.14
Registry
-0.14
POSITIVE LOGITS
ato
0.15
ês
0.14
wyn
0.14
NB
0.14
hawk
0.14
IES
0.14
eken
0.14
gap
0.14
.vars
0.14
engers
0.13
Activations Density 0.000%