INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     SharedPreferences
    -0.07
    response
    -0.07
    .te
    -0.06
     человечес
    -0.06
     Rows
    -0.06
     Nigel
    -0.06
    .Big
    -0.06
     случаев
    -0.06
     Ragnar
    -0.06
     товар
    -0.06
    POSITIVE LOGITS
    ','','
    0.07
     '↵
    0.06
    otence
    0.06
     UP
    0.06
     "=
    0.06
     oslo
    0.06
     "
    0.06
    LOGY
    0.06
    0.06
    >;↵
    0.06
    Act Density 0.004%

    No Known Activations