INDEX
    Explanations

    give an idea of understanding

    New Auto-Interp
    Negative Logits
     utilizza
    0.41
     experimented
    0.38
     utilizzare
    0.38
     utilice
    0.38
     utilisent
    0.37
     utilizando
    0.36
     Escherichia
    0.36
    itivos
    0.36
    經歷
    0.36
     utilizar
    0.36
    POSITIVE LOGITS
     understand
    1.57
     понять
    1.52
     understanding
    1.48
     capire
    1.45
    了解
    1.41
     понима
    1.37
     understands
    1.34
     hiểu
    1.34
     이해
    1.33
     понимать
    1.32
    Act Density 0.056%

    No Known Activations