INDEX
Explanations
phrases indicating quantity or reference to unspecified groups
New Auto-Interp
Negative Logits
kasarigan
-0.80
AddHtmlAttribute
-0.75
يتيمه
-0.72
Réponses
-0.72
ParallelGroup
-0.69
Vidite
-0.65
httphttps
-0.65
CURIAM
-0.62
Smarty
-0.61
цездатний
-0.60
POSITIVE LOGITS
jmniej
0.76
zumindest
0.69
setidaknya
0.68
almeno
0.66
atleast
0.64
wenigstens
0.63
فريبيس
0.55
minimum
0.55
至少
0.55
حداقل
0.55
Activations Density 0.123%