INDEX
Explanations
references to terrorism and political violence
Non-English text and symbols
New Auto-Interp
Negative Logits
®
-0.44
٪
-0.43
😡
-0.43
hsv
-0.41
!!)
-0.40
(“
-0.39
apparaître
-0.39
++;
-0.39
Unfortunately
-0.38
!!
-0.38
POSITIVE LOGITS
AndEndTag
0.99
Normdatei
0.96
RegistryLite
0.86
resourceCulture
0.86
tvguidetime
0.85
RTLR
0.84
曖昧さ回避
0.83
Rüyada
0.83
NDEBUG
0.79
Hochspringen
0.78
Activations Density 0.125%