INDEX
    Explanations

    names of individuals involved in news stories or events

    proper nouns, particularly names of individuals mentioned in criminal contexts

    New Auto-Interp
    Negative Logits
     mathemat
    -0.82
     guaranteeing
    -0.81
    cffff
    -0.81
     optim
    -0.77
     forecasting
    -0.75
     bookmark
    -0.74
     charism
    -0.73
    pmwiki
    -0.72
     pse
    -0.71
     conclud
    -0.70
    POSITIVE LOGITS
     Doe
    1.24
     Tsarnaev
    1.07
     Zimmerman
    1.00
     Bundy
    0.96
     Hernandez
    0.92
     Ramirez
    0.92
     Paddock
    0.91
     Martinez
    0.87
     Rodriguez
    0.87
     Nguyen
    0.86
    Act Density 0.428%

    No Known Activations