INDEX
    Explanations

    phrases related to news headlines, including topics like political activities, medical incidents, criminal actions, and official statements

    instances of events or situations involving significant harm or danger

    New Auto-Interp
    Negative Logits
    iton
    -0.65
    в
    -0.63
     ACS
    -0.63
     life
    -0.63
    tons
    -0.62
    hement
    -0.60
    pps
    -0.60
    hest
    -0.58
    д
    -0.58
    ean
    -0.58
    POSITIVE LOGITS
    ccording
    0.77
    EPA
    0.70
    BBC
    0.69
    rouse
    0.65
     Prosecutors
    0.64
    inav
    0.62
    accompan
    0.62
    ³³³³³³³³³³³³³³³³
    0.60
    Prosecut
    0.60
     prosecutor
    0.59
    Act Density 0.131%

    No Known Activations