INDEX
    Explanations

    references to partisan content or affiliations

    New Auto-Interp
    Negative Logits
     externi
    -0.68
     يتيمه
    -0.68
    ThroughAttribute
    -0.63
    קישורים
    -0.57
     NSCoder
    -0.56
     &___
    -0.56
    -0.56
    NameInMap
    -0.52
     estimés
    -0.51
    دانشنامهٔ
    -0.51
    POSITIVE LOGITS
    cshtml
    1.23
    partisan
    0.79
    ynos
    0.75
    imetry
    0.73
    \{(
    0.71
    JspWriter
    0.70
    πή
    0.69
    wsj
    0.69
    OutputType
    0.68
    ScopeManager
    0.68
    Act Density 0.085%

    No Known Activations