INDEX
    Explanations

    references to plane incidents and safety concerns

    New Auto-Interp
    Negative Logits
    ãĥĨãĥ«
    -0.17
    owie
    -0.15
    ķĮ
    -0.15
     Charges
    -0.15
    flush
    -0.15
     gating
    -0.15
    éĺħ读次æķ°
    -0.15
     Charge
    -0.14
    zcze
    -0.14
     Morgan
    -0.14
    POSITIVE LOGITS
     engine
    0.17
     grounded
    0.16
     McDon
    0.15
     grounding
    0.15
    ras
    0.14
    *pow
    0.14
    AGR
    0.14
     https
    0.14
     dumps
    0.14
     dump
    0.14
    Act Density 0.025%

    No Known Activations