INDEX
    Explanations

    manslaughter

    New Auto-Interp
    Negative Logits
    uma
    -0.08
     уч
    -0.08
     Internal
    -0.08
    mall
    -0.07
     TPU
    -0.07
     infest
    -0.07
     vitt
    -0.07
    ്ദ
    -0.07
    uffling
    -0.07
     داخلی
    -0.07
    POSITIVE LOGITS
     TIM
    0.08
    Bent
    0.08
    0.08
     обяз
    0.08
     negligence
    0.08
     oversight
    0.08
     compassionate
    0.07
     honorable
    0.07
     Bent
    0.07
    В
    0.07
    Act Density 0.004%

    No Known Activations