INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    heit
    -0.68
    nio
    -0.67
    ة
    -0.65
    ing
    -0.64
     minY
    -0.62
    jboss
    -0.61
    ]';
    -0.60
    herme
    -0.59
     Réponses
    -0.57
    oneer
    -0.56
    POSITIVE LOGITS
    jména
    0.64
     propOrder
    0.62
    іга
    0.55
     ruolo
    0.55
     Wolken
    0.55
    topus
    0.55
    ruitment
    0.54
     vistos
    0.54
     asupra
    0.54
     ExecuteAsync
    0.54
    Act Density 0.124%

    No Known Activations