INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     aspecto
    0.39
     aspetto
    0.38
    0.35
     bella
    0.35
     ecc
    0.34
    component
    0.33
     blij
    0.33
     aspectos
    0.33
    aspect
    0.33
     aspect
    0.32
    POSITIVE LOGITS
     answer
    0.74
     response
    0.67
    resposta
    0.64
    回答
    0.63
     답변
    0.61
     reply
    0.60
     आंसर
    0.60
    answer
    0.60
     ответы
    0.60
     réponse
    0.59
    Act Density 0.009%

    No Known Activations