INDEX
Negative Logits
diet
-0.06
oru
-0.06
yapı
-0.06
Came
-0.06
scarc
-0.06
useClass
-0.06
ῶν
-0.06
dw
-0.06
-----------
-0.06
Sang
-0.05
POSITIVE LOGITS
RAIN
0.07
comprehend
0.07
Posting
0.07
फ
0.07
(MainActivity
0.06
passing
0.06
úmero
0.06
turn
0.06
afternoon
0.06
art
0.06
Activations Density 0.002%