INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Bourne
    -0.73
    box
    -0.70
    Preference
    -0.68
    meras
    -0.67
     available
    -0.67
     Igor
    -0.67
    -0.67
     stimulated
    -0.67
    ociação
    -0.67
    uing
    -0.66
    POSITIVE LOGITS
    tray
    0.69
    Tray
    0.67
     MRP
    0.66
     RouterModule
    0.65
    Género
    0.63
     офи
    0.63
    ffd
    0.63
    gräns
    0.62
     Keine
    0.61
     coiff
    0.61
    Act Density 0.064%

    No Known Activations