INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Work
    -0.07
    [curr
    -0.07
     해당
    -0.07
     این
    -0.07
    -0.07
    Extensions
    -0.07
    -0.06
     provider
    -0.06
     followers
    -0.06
     Searches
    -0.06
    POSITIVE LOGITS
     existe
    0.06
    一下
    0.06
     coleg
    0.06
     бактер
    0.06
     الى
    0.06
    0.06
    Located
    0.06
     různé
    0.06
    						   
    0.06
     quyền
    0.06
    Act Density 0.094%

    No Known Activations