INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     glaucoma
    -0.08
     Gross
    -0.08
    (JS
    -0.07
    (net
    -0.07
    cloth
    -0.07
    كه
    -0.07
     withstand
    -0.07
    alik
    -0.07
     Smaller
    -0.07
    Gross
    -0.07
    POSITIVE LOGITS
    omon
    0.08
    0.08
    0.08
    0.08
    Neighbors
    0.07
     arkadaş
    0.07
     Nicole
    0.07
    েরা
    0.07
    -element
    0.07
     элемента
    0.07
    Act Density 0.007%

    No Known Activations