INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     Promotion
    -0.07
     Calgary
    -0.06
     maternity
    -0.06
    Nr
    -0.06
    ortion
    -0.06
    _overlay
    -0.06
     Друг
    -0.06
    -pin
    -0.06
     Rating
    -0.06
    POSITIVE LOGITS
    0.07
     внут
    0.06
     envy
    0.06
    κό
    0.06
     berry
    0.06
    rbrace
    0.06
    stre
    0.06
    insert
    0.06
    alking
    0.06
     beg
    0.06
    Act Density 0.006%

    No Known Activations