INDEX
    Explanations

    occurrences of specific nouns and their attributes related to individuals, actions, and circumstances in criminal news stories

    New Auto-Interp
    Negative Logits
    asser
    -0.07
    bine
    -0.07
    Formatter
    -0.07
    OfString
    -0.07
    ãĥ³ãĥĢ
    -0.07
    erk
    -0.07
    stype
    -0.07
    utes
    -0.07
    ÑĥлÑĮÑĤа
    -0.07
    elere
    -0.07
    POSITIVE LOGITS
     man
    0.07
     woman
    0.07
    echan
    0.06
     someone
    0.06
     male
    0.06
     young
    0.06
    ged
    0.06
    odka
    0.06
     ac
    0.05
     employee
    0.05
    Act Density 0.013%

    No Known Activations