INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Liv
    -0.07
     statusBar
    -0.06
     Best
    -0.06
    Successfully
    -0.06
     supermarkets
    -0.06
    sworth
    -0.06
    larla
    -0.06
    .activity
    -0.06
    .Shape
    -0.06
    .qu
    -0.06
    POSITIVE LOGITS
    (bin
    0.08
    _INET
    0.07
    objc
    0.06
     analytic
    0.06
     decis
    0.06
     scrut
    0.06
    CellValue
    0.06
     breat
    0.06
     stunt
    0.06
     сбор
    0.06
    Act Density 0.005%

    No Known Activations