INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     jette
    -0.65
     reconnaît
    -0.63
     croit
    -0.61
     sienta
    -0.59
     envoie
    -0.59
     révèle
    -0.58
     applicazioni
    -0.57
     encuentre
    -0.57
     inclut
    -0.56
     mantenga
    -0.56
    POSITIVE LOGITS
     unve
    0.94
     jorge
    0.93
     sergio
    0.92
     suscep
    0.91
     viciss
    0.91
     accla
    0.87
     jake
    0.86
     mattel
    0.86
     versace
    0.85
     inconce
    0.85
    Act Density 0.420%

    No Known Activations