INDEX
Negative Logits
iverr
-0.09
appeared
-0.09
ehr
-0.08
-backed
-0.08
pledged
-0.08
Appe
-0.08
gbogbo
-0.08
েপ্ট
-0.08
ambled
-0.08
பாட
-0.08
POSITIVE LOGITS
which
0.08
".
0.07
destroying
0.07
तुल
0.07
Const
0.07
variety
0.07
.destroy
0.07
dua
0.07
جمهور
0.07
typelib
0.07
Activations Density 0.028%