INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Não
    -0.06
     para
    -0.06
     můžeme
    -0.06
     случ
    -0.06
    _Db
    -0.06
     oppose
    -0.06
    capacity
    -0.06
    	sb
    -0.06
     acted
    -0.06
     aden
    -0.06
    POSITIVE LOGITS
    adratic
    0.07
    Miss
    0.06
     complying
    0.06
    _definitions
    0.06
    Wik
    0.06
    _mult
    0.06
    -DD
    0.06
    CE
    0.06
    /arm
    0.06
     yup
    0.06
    Act Density 0.000%

    No Known Activations