INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.06
     tờ
    -0.06
    Devices
    -0.06
     Wisdom
    -0.06
    reading
    -0.06
     Zionist
    -0.06
     oby
    -0.06
     ward
    -0.06
     knives
    -0.06
    istingu
    -0.06
    POSITIVE LOGITS
     Jan
    0.07
    .beginPath
    0.07
    	gl
    0.07
     isKindOfClass
    0.07
     قاب
    0.06
     Katie
    0.06
    드는
    0.06
     duel
    0.06
     startY
    0.06
    0.06
    Act Density 0.008%

    No Known Activations