INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	ORDER
    -0.07
    мест
    -0.06
    	ref
    -0.06
     WORD
    -0.06
     Chairman
    -0.06
    (크기
    -0.06
    usat
    -0.06
     CircularProgress
    -0.06
    emás
    -0.06
    ampionship
    -0.06
    POSITIVE LOGITS
    _REQUIRE
    0.07
     Sci
    0.06
     vlak
    0.06
    						    
    0.06
     thu
    0.06
     environment
    0.06
     identifier
    0.06
    birds
    0.06
     Bulk
    0.06
    _general
    0.06
    Act Density 0.004%

    No Known Activations