INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     آل
    -0.08
    %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
    -0.07
    _Manager
    -0.07
    -0.07
    αλ
    -0.07
    FragmentManager
    -0.07
     Enough
    -0.06
     RA
    -0.06
    кор
    -0.06
     ################################################
    -0.06
    POSITIVE LOGITS
    -bordered
    0.07
     newList
    0.07
    idelity
    0.06
    uze
    0.06
    idak
    0.06
    0.06
    systems
    0.06
    (include
    0.06
    vention
    0.06
     [&
    0.06
    Act Density 0.056%

    No Known Activations