INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Playboy
    -0.08
    yaw
    -0.07
     nová
    -0.07
     πιο
    -0.07
     Cata
    -0.07
    BTTagCompound
    -0.07
    ValueType
    -0.06
    	arg
    -0.06
    보내기
    -0.06
    すれば
    -0.06
    POSITIVE LOGITS
     su
    0.06
    0.06
     doctors
    0.06
     photograph
    0.06
     *}
    0.06
     коміс
    0.06
     contributions
    0.06
     z
    0.06
    _guess
    0.06
    0.06
    Act Density 0.002%

    No Known Activations