INDEX
    Explanations

    mentions of airline crashes and disasters

    New Auto-Interp
    Negative Logits
    ulumi
    -0.17
    urga
    -0.17
    phinx
    -0.15
    eum
    -0.15
    .Slf
    -0.15
    inou
    -0.14
     Threat
    -0.14
    hta
    -0.14
    جار
    -0.14
     Giov
    -0.14
    POSITIVE LOGITS
     crash
    0.22
     accident
    0.18
     fault
    0.18
    coll
    0.18
     disaster
    0.17
     loss
    0.17
    öh
    0.17
     coll
    0.17
     crashed
    0.16
     collision
    0.16
    Act Density 0.098%

    No Known Activations