INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     changing
    -0.07
    -cloud
    -0.07
     Mojo
    -0.07
     change
    -0.07
     Vance
    -0.06
    dddd
    -0.06
    _group
    -0.06
     Change
    -0.06
    ché
    -0.06
     امر
    -0.06
    POSITIVE LOGITS
     fatal
    0.15
     fatally
    0.12
     Fatal
    0.12
    Fatal
    0.10
     fatalities
    0.09
    fatal
    0.08
    .Fatal
    0.08
     rentals
    0.07
     lethal
    0.07
    tel
    0.07
    Act Density 0.006%

    No Known Activations