INDEX
Negative Logits
kone
-0.08
keen
-0.08
Hel
-0.08
shield
-0.07
Received
-0.07
keď
-0.07
いい
-0.07
mouseup
-0.07
lur
-0.07
יצה
-0.07
POSITIVE LOGITS
वार
0.08
Diagram
0.08
वारी
0.08
diagram
0.08
વાર
0.08
introductory
0.08
Sugar
0.08
redesigned
0.08
Hof
0.08
bem
0.07
Activations Density 0.006%