INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
    .Order
    -0.06
    лод
    -0.06
    اي
    -0.06
     impro
    -0.06
     explosive
    -0.06
    (defun
    -0.06
    ัพท
    -0.06
    ух
    -0.06
    bz
    -0.06
    POSITIVE LOGITS
    міністра
    0.07
    (IService
    0.07
    .readline
    0.06
     FormsModule
    0.06
    ména
    0.06
     innovation
    0.06
     campaigns
    0.06
    (node
    0.06
     inequality
    0.06
     Pearl
    0.06
    Act Density 0.039%

    No Known Activations