INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     hoc
    -0.07
     Sec
    -0.07
    	call
    -0.07
     mocker
    -0.07
     mam
    -0.07
    Ticker
    -0.07
     اسلامی
    -0.07
     Nep
    -0.07
    Camera
    -0.07
     Telescope
    -0.07
    POSITIVE LOGITS
     width
    0.10
     Width
    0.09
    Width
    0.09
     Midwest
    0.09
    -width
    0.09
    	width
    0.09
    _WIDTH
    0.08
    win
    0.08
    w
    0.08
    .width
    0.08
    Act Density 0.016%

    No Known Activations