INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Educ
    -0.07
     Серед
    -0.07
    ,'#
    -0.06
    libc
    -0.06
    ucursal
    -0.06
    *M
    -0.06
    494
    -0.06
    raud
    -0.06
    783
    -0.06
    ,'%
    -0.06
    POSITIVE LOGITS
     going
    0.20
     go
    0.18
     goes
    0.17
     went
    0.16
     Going
    0.14
     gone
    0.13
     Go
    0.13
    Going
    0.12
    Go
    0.12
    going
    0.11
    Act Density 0.074%

    No Known Activations