INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
    енью
    -0.07
     sublic
    -0.07
     vice
    -0.06
     Soviets
    -0.06
     Atkins
    -0.06
    Пос
    -0.06
     фрон
    -0.06
    цією
    -0.06
     деся
    -0.06
    POSITIVE LOGITS
    	Return
    0.07
    <Component
    0.07
    .id
    0.07
    ,double
    0.06
     Optional
    0.06
     linebacker
    0.06
     dag
    0.06
    remely
    0.06
     você
    0.06
    {id
    0.06
    Act Density 0.020%

    No Known Activations