INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     apparatus
    -0.07
     Industries
    -0.07
     تجاوز
    -0.07
     Mira
    -0.07
    -0.07
     surf
    -0.07
     pul
    -0.07
     kuch
    -0.07
    Mention
    -0.07
     seem
    -0.07
    POSITIVE LOGITS
     regard
    0.07
    mk
    0.07
    ellipse
    0.07
     conduc
    0.07
    kc
    0.07
    Tul
    0.07
     Recre
    0.07
     eun
    0.07
    0.07
    162
    0.07
    Act Density 0.234%

    No Known Activations