INDEX
Explanations
references to governmental authority and actions
New Auto-Interp
Negative Logits
kasarigan
-0.82
httphttps
-0.79
ագրություններ
-0.77
Дереккөздер
-0.76
?>/
-0.76
twimg
-0.75
Ligações
-0.66
mergeFrom
-0.66
beginnetje
-0.65
PreferredItem
-0.64
POSITIVE LOGITS
tagext
0.57
0.55
<u>
0.49
’
0.48
0.47
paras
0.47
0.47
0.45
Kraj
0.45
0.45
Activations Density 0.120%