INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     confusion
    -0.07
     tijd
    -0.07
     hodin
    -0.06
    ('/')
    -0.06
     has
    -0.06
    WRAPPER
    -0.06
     have
    -0.06
    (rec
    -0.06
    "]);
    -0.06
     trousers
    -0.06
    POSITIVE LOGITS
    ¤¤
    0.07
    اخر
    0.06
    شو
    0.06
    ierce
    0.06
     Miracle
    0.06
    .O
    0.06
     приготов
    0.06
    .Configure
    0.06
    ket
    0.06
     Apply
    0.06
    Act Density 0.027%

    No Known Activations