INDEX
Negative Logits
household
-0.71
عربي
-0.68
والاست
-0.68
Slovakia
-0.67
lupa
-0.65
proportionate
-0.64
Palmerston
-0.63
生まれ
-0.63
chat
-0.63
packaging
-0.63
POSITIVE LOGITS
hibli
0.87
Supplemental
0.69
图书馆
0.65
Supplemental
0.64
mnop
0.63
libs
0.63
λασ
0.63
⎬
0.63
まります
0.62
ancier
0.62
Activations Density 0.137%