INDEX
    Explanations

    references to the New York Times

    New Auto-Interp
    Negative Logits
    ьаж
    -0.91
    __*/
    -0.83
     Савезне
    -0.82
    adpleegd
    -0.80
     "..\..\
    -0.76
    styleUrls
    -0.75
    ibouti
    -0.72
    Geplaatst
    -0.70
    Hentet
    -0.68
    Ծանոթ
    -0.67
    POSITIVE LOGITS
     Times
    0.93
    Times
    0.85
     NYT
    0.84
    nytimes
    0.77
     TIMES
    0.74
    times
    0.71
     times
    0.61
    TIMES
    0.56
     Nymp
    0.56
     CUP
    0.52
    Act Density 0.009%

    No Known Activations