INDEX
    Explanations

    breaks and stopping points

    New Auto-Interp
    Negative Logits
    <Result
    -0.07
     unarmed
    -0.07
     calcular
    -0.06
            
    -0.06
    gra
    -0.06
     Liverpool
    -0.06
    charm
    -0.05
     Kar
    -0.05
    قلال
    -0.05
     přátel
    -0.05
    POSITIVE LOGITS
     Contin
    0.07
     захоп
    0.07
     getColumn
    0.06
    lamp
    0.06
     product
    0.06
    ufac
    0.06
     manière
    0.06
     yaz
    0.06
    emu
    0.06
     slick
    0.06
    Act Density 0.070%

    No Known Activations