INDEX
Negative Logits
ba
-0.08
Had
-0.08
/key
-0.08
wan
-0.08
Waik
-0.08
/be
-0.07
동안
-0.07
-0.07
_prior
-0.07
ka
-0.07
POSITIVE LOGITS
ுடைய
0.09
δυ
0.08
homen
0.08
pinnacle
0.08
ieving
0.08
ratt
0.07
ಳಿ
0.07
dita
0.07
Ron
0.07
aven
0.07
Activations Density 0.005%