INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     genoux
    -0.63
     pauvres
    -0.54
     való
    -0.53
     blessés
    -0.51
     refroid
    -0.51
     détru
    -0.51
    gypti
    -0.51
    twimg
    -0.50
     jambes
    -0.49
     lèvres
    -0.49
    POSITIVE LOGITS
    Personensuche
    0.65
    AxisAlignment
    0.64
    astéroïdes
    0.64
     StatelessWidget
    0.59
    #+#
    0.59
    WithIOException
    0.59
    lessness
    0.58
     kaarangay
    0.57
    netics
    0.56
     Савезне
    0.55
    Act Density 0.001%

    No Known Activations