INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     BİL
    -0.06
    icie
    -0.06
    udic
    -0.06
    dal
    -0.06
    \Collections
    -0.06
    loub
    -0.06
     아이
    -0.06
     sagen
    -0.06
    wrap
    -0.06
    -0.06
    POSITIVE LOGITS
     Beyond
    0.07
    .Customer
    0.07
    _categoria
    0.07
     obrig
    0.06
    .category
    0.06
     Programming
    0.06
    [random
    0.06
     Moder
    0.06
    Exporter
    0.06
    者の
    0.06
    Act Density 0.022%

    No Known Activations