INDEX
    Explanations

    function, Role, agent, Spanish, English

    New Auto-Interp
    Negative Logits
    0.47
    alar
    0.46
    되는
    0.46
     Иванович
    0.45
     siblings
    0.45
    дары
    0.45
     captivating
    0.44
     looting
    0.44
     되는
    0.44
    aisi
    0.43
    POSITIVE LOGITS
    പരി
    0.50
    s
    0.50
     सालाना
    0.49
     Kishan
    0.48
     किशन
    0.47
     عيد
    0.47
     añade
    0.47
    iculate
    0.46
    űt
    0.46
     acepta
    0.46
    Act Density 0.000%

    No Known Activations