INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Measure
    -0.07
     bombard
    -0.07
     Shall
    -0.07
     summ
    -0.07
     shore
    -0.07
     cav
    -0.07
    -0.07
     employ
    -0.07
     shores
    -0.07
     flavour
    -0.07
    POSITIVE LOGITS
    fatal
    0.10
     schlimm
    0.10
    严重
    0.10
     devastated
    0.09
     devastating
    0.09
     fatal
    0.09
     conséquences
    0.09
     raakt
    0.09
     consequências
    0.09
    -neck
    0.09
    Act Density 0.020%

    No Known Activations