INDEX
Explanations
references to legal concepts, particularly related to jury instructions and evidence
New Auto-Interp
Negative Logits
%@",
-0.53
+"/"+
-0.47
})*/
-0.46
<mask>
-0.46
">'+
-0.46
)=>{-0.46
<?=$
-0.45
+"_
-0.45
});*/
-0.44
/**
-0.44
POSITIVE LOGITS
[toxicity=0]
0.72
0.63
initComponents
0.54
consultato
0.54
betweenstory
0.53
resourceCulture
0.52
ViewImports
0.51
parsedMessage
0.50
UnusedPrivate
0.48
négatif
0.47
Activations Density 0.013%