INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     migrant
    -0.07
     track
    -0.07
     допомоги
    -0.07
     مار
    -0.07
    	sw
    -0.07
    운데
    -0.06
     Dil
    -0.06
    About
    -0.06
    _ZERO
    -0.06
     leopard
    -0.06
    POSITIVE LOGITS
    Register
    0.07
    ,↵
    0.06
    SCREEN
    0.06
    _ENV
    0.06
     PartialEq
    0.06
    :%
    0.06
    час
    0.06
    وی
    0.06
     im
    0.06
    []={
    0.06
    Act Density 0.031%

    No Known Activations