INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ��
    -0.08
     dịch
    -0.08
     comprised
    -0.07
     stepped
    -0.07
     competent
    -0.07
     piatta
    -0.07
    gangen
    -0.07
    	service
    -0.07
     composed
    -0.07
     perched
    -0.07
    POSITIVE LOGITS
     Buh
    0.08
     Dha
    0.08
     Kap
    0.08
    Ka
    0.08
    Org
    0.07
     Chá
    0.07
     Chicken
    0.07
    (region
    0.07
     Бу
    0.07
    řejmě
    0.07
    Act Density 0.001%

    No Known Activations