INDEX
    Explanations

    instances of the word "different."

    New Auto-Interp
    Negative Logits
     vectorielle
    -0.99
     mijne
    -0.92
     nucléaire
    -0.85
     picioare
    -0.82
     umane
    -0.82
     étoient
    -0.82
     prenez
    -0.80
     sólidos
    -0.78
     poussière
    -0.78
     vectorielles
    -0.78
    POSITIVE LOGITS
     different
    2.23
    Different
    2.06
     Different
    1.99
    different
    1.99
     DIFFERENT
    1.80
     difer
    1.48
     diferente
    1.45
     diferentes
    1.40
    不同
    1.39
    不同的
    1.31
    Act Density 0.163%

    No Known Activations