INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     magistrate
    -0.07
    ocs
    -0.06
    setChecked
    -0.06
    ámara
    -0.06
    ��
    -0.06
     فرزند
    -0.06
    _Ptr
    -0.06
     volumes
    -0.06
    뉴스
    -0.06
    .Padding
    -0.06
    POSITIVE LOGITS
     interpersonal
    0.06
     pregnancies
    0.06
     milestone
    0.06
    0.06
     poking
    0.06
     poke
    0.06
     [])↵
    0.06
    Ga
    0.06
    تز
    0.06
    yssey
    0.06
    Act Density 0.002%

    No Known Activations