INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     дней
    -0.07
    oothing
    -0.07
    MENTS
    -0.06
     Colors
    -0.06
    <input
    -0.06
     Castle
    -0.06
     místo
    -0.06
    fclose
    -0.06
    ucing
    -0.06
     instit
    -0.06
    POSITIVE LOGITS
     grabbed
    0.10
     Grab
    0.09
     grab
    0.09
     grabbing
    0.08
    Grab
    0.07
    ongyang
    0.07
     keywords
    0.07
    anceled
    0.06
    igation
    0.06
    .machine
    0.06
    Act Density 0.007%

    No Known Activations