INDEX
    Explanations

    phrases related to political controversy and accusations

    New Auto-Interp
    Negative Logits
     Diseases
    -0.16
     diseases
    -0.15
     Impl
    -0.15
    sez
    -0.14
     wars
    -0.14
     Fires
    -0.14
    ammable
    -0.13
     Bom
    -0.13
    launcher
    -0.13
    วà¸ģ
    -0.13
    POSITIVE LOGITS
     incident
    0.60
     event
    0.50
     episode
    0.47
    incident
    0.44
     Incident
    0.39
     encounter
    0.38
    äºĭä»¶
    0.37
     events
    0.36
     experience
    0.35
    episode
    0.35
    Act Density 0.280%

    No Known Activations