INDEX
Explanations
classifications and categories related to severity and impact assessments
New Auto-Interp
Negative Logits
amm
-0.14
FixedSize
-0.14
czy
-0.14
éĽĨä¸Ń
-0.14
arios
-0.13
younger
-0.13
.scalablytyped
-0.13
Laz
-0.13
Ini
-0.13
precisely
-0.13
POSITIVE LOGITS
medium
0.40
medium
0.38
Medium
0.37
Medium
0.37
moderate
0.34
Moderate
0.34
-medium
0.32
Moder
0.30
Moder
0.30
intermediate
0.29
Activations Density 0.166%