INDEX
    Explanations

    proper nouns, specifically names of individuals and their affiliations in news articles

    New Auto-Interp
    Negative Logits
     newspapers
    -0.45
     AspNetCore
    -0.45
     ujednoznacz
    -0.42
    ]-->
    -0.42
    Eloquent
    -0.41
    TextSpan
    -0.41
    gonic
    -0.41
    참고
    -0.40
    WithEmail
    -0.40
     Anya
    -0.40
    POSITIVE LOGITS
    Tikang
    0.73
    transQ
    0.72
    قایناقلار
    0.67
    "])
    
    0.63
    ništ
    0.61
     '\\;'
    0.59
    RegressionTest
    0.58
     समीक्षाओं
    0.57
    }))
    
    0.56
     <=",
    0.56
    Act Density 0.022%

    No Known Activations