INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     이를
    -0.07
    -0.07
    stored
    -0.07
     pay
    -0.07
    orr
    -0.07
     ure
    -0.07
     hoja
    -0.07
     же
    -0.07
    -0.07
     sheath
    -0.07
    POSITIVE LOGITS
     vandaan
    0.09
     قرار
    0.09
     रखने
    0.08
     estamos
    0.08
    قرر
    0.08
     campanha
    0.08
     કરવા
    0.08
     करने
    0.08
     pertenc
    0.08
    |-
    0.07
    Act Density 0.157%

    No Known Activations