INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     prév
    -0.09
     moyen
    -0.09
    ейт
    -0.09
    осред
    -0.08
     parkeer
    -0.08
     minister
    -0.08
     atr
    -0.08
    окой
    -0.08
    _when
    -0.08
     eid
    -0.08
    POSITIVE LOGITS
    igh
    0.19
    IGH
    0.11
    igth
    0.11
    ighed
    0.10
    omach
    0.09
    igt
    0.09
    ighe
    0.08
     Pant
    0.08
    ighth
    0.08
    rust
    0.08
    Act Density 0.002%

    No Known Activations