INDEX
Explanations
programming and programming-related constructs in the text
New Auto-Interp
Negative Logits
tartalomajánló
-0.69
EClass
-0.68
verwijspagina
-0.67
μβρίου
-0.65
سكانية
-0.63
rungsseite
-0.63
мәкал
-0.63
≦)
-0.63
cookieParser
-0.62
antMatchers
-0.61
POSITIVE LOGITS
<strong>
0.66
//
0.63
<b>
0.62
#
0.61
()]
0.55
///
0.52
0.52
amint
0.50
[toxicity=0]
0.50
_
0.50
Activations Density 0.099%