INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	The
    -0.07
    captcha
    -0.06
     Для
    -0.06
    ________________________________
    -0.06
    (NUM
    -0.06
    .attach
    -0.06
     Harrison
    -0.06
     Hanging
    -0.06
     closes
    -0.06
     SCRIPT
    -0.06
    POSITIVE LOGITS
    kont
    0.07
    pix
    0.07
     /
    0.06
     mohlo
    0.06
     modeled
    0.06
     расч
    0.06
    مد
    0.06
    рез
    0.06
    oretical
    0.06
     veut
    0.06
    Act Density 0.003%

    No Known Activations