INDEX
    Explanations

    results and their outcomes

    New Auto-Interp
    Negative Logits
     дві
    0.63
    verted
    0.60
    aktadır
    0.58
     άλλο
    0.58
    IB
    0.57
     part
    0.56
     metabolism
    0.56
     cello
    0.56
     όλα
    0.56
    不允许
    0.55
    POSITIVE LOGITS
     resultado
    0.96
     obtenido
    0.96
     resultados
    0.93
     results
    0.86
     Résultats
    0.84
     Ergebnisse
    0.83
    結果
    0.82
    ant
    0.80
    结果
    0.78
     obtenidos
    0.77
    Act Density 0.088%

    No Known Activations