INDEX
Negative Logits
Charge
0.63
CHAP
0.62
fire
0.61
voluntarily
0.61
ributive
0.60
kowitz
0.60
charge
0.60
charge
0.59
Equality
0.59
riline
0.59
POSITIVE LOGITS
makes
0.56
Makes
0.56
मोबाईल
0.54
aquellas
0.54
wanted
0.53
अव
0.52
ቢ
0.51
би
0.51
Performance
0.51
煨
0.50
Activations Density 0.137%