INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    
    -0.07
    VC
    -0.06
     ordinance
    -0.06
     внутрен
    -0.06
    <Test
    -0.06
     Visibility
    -0.06
    experience
    -0.06
    vc
    -0.06
    807
    -0.06
     역사
    -0.06
    POSITIVE LOGITS
     Users
    0.07
    84
    0.07
    0.07
    UDP
    0.07
     alleges
    0.07
     Trim
    0.07
     users
    0.06
     анти
    0.06
    bee
    0.06
     слож
    0.06
    Act Density 0.003%

    No Known Activations