INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    айд
    -0.06
    IData
    -0.06
    (Order
    -0.06
     مرگ
    -0.06
     exit
    -0.06
     кат
    -0.06
    Christ
    -0.06
    sans
    -0.05
     bystand
    -0.05
     size
    -0.05
    POSITIVE LOGITS
    0.07
     Calories
    0.07
     glazed
    0.07
     Magnum
    0.06
    elter
    0.06
    _DISABLE
    0.06
    iteration
    0.06
     Software
    0.06
     getters
    0.06
    .wrapper
    0.06
    Act Density 0.070%

    No Known Activations