INDEX
    Explanations

    references to individuals, specifically names or titles

    New Auto-Interp
    Negative Logits
     nahilalakip
    -1.04
    AddTagHelper
    -0.88
    لينكات
    -0.78
     CanadaChoose
    -0.76
     AssemblyCulture
    -0.72
    قایناقلار
    -0.70
    Enllaces
    -0.68
    TagMode
    -0.68
    icolon
    -0.68
    unhofer
    -0.68
    POSITIVE LOGITS
     iprot
    0.54
     faute
    0.49
    BindView
    0.44
    None
    0.44
    tex
    0.43
     bì
    0.42
    nytimes
    0.40
    が進
    0.39
     resim
    0.39
     NYT
    0.38
    Act Density 0.106%

    No Known Activations