INDEX
Negative Logits
+
0.79
also
0.70
Tw
0.66
/
0.66
Ad
0.64
&
0.64
Delhi
0.64
0.64
its
0.63
Di
0.63
POSITIVE LOGITS
㕧
0.85
effectuer
0.80
iremos
0.77
ܤ
0.77
गिवन
0.76
Ꮘ
0.76
verilen
0.75
Ꮡ
0.75
ologici
0.74
ANSAS
0.74
Activations Density 0.001%
+
also
Tw
/
Ad
&
Delhi
its
Di
㕧
effectuer
iremos
ܤ
गिवन
Ꮘ
verilen
Ꮡ
ologici
ANSAS