INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
    relu
    -0.06
     Erg
    -0.06
    ceptors
    -0.06
    -0.06
     Against
    -0.06
     одним
    -0.06
     galer
    -0.06
    _pages
    -0.06
    swire
    -0.06
    POSITIVE LOGITS
     provoz
    0.08
     yapmaya
    0.07
     Ziel
    0.07
     suspicious
    0.06
     vascular
    0.06
     glazed
    0.06
    ExecutionContext
    0.06
     yazılı
    0.06
     Noble
    0.06
     yılı
    0.06
    Act Density 0.072%

    No Known Activations