INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     sábado
    -0.07
     delimited
    -0.07
     yasal
    -0.06
     birinin
    -0.06
     millenn
    -0.06
     shave
    -0.06
     antivirus
    -0.06
     konum
    -0.06
    .description
    -0.06
    šla
    -0.06
    POSITIVE LOGITS
     caught
    0.13
     catching
    0.11
     catches
    0.10
     catcher
    0.09
     Caught
    0.09
    Caught
    0.09
     catch
    0.08
     цей
    0.07
    -catching
    0.07
     Respond
    0.07
    Act Density 0.008%

    No Known Activations