INDEX
    Explanations

    words related to statements or claims being made

    statements or comments made by individuals in a reporting context

    New Auto-Interp
    Negative Logits
    EDIT
    -0.71
    irs
    -0.66
    ktop
    -0.64
    ï¸
    -0.64
    Justice
    -0.62
    soType
    -0.60
    xtap
    -0.60
    gencies
    -0.60
    terness
    -0.60
     GOODMAN
    -0.60
    POSITIVE LOGITS
     "[
    0.91
     "â̦
    0.88
     omin
    0.75
     bluntly
    0.72
     '[
    0.72
     aloud
    0.72
     "#
    0.70
     publicly
    0.68
     underestimated
    0.66
     Saddam
    0.65
    Act Density 0.389%

    No Known Activations