INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Drapeau
    -0.73
    Revenir
    -0.64
     BoxFit
    -0.59
     Corona
    -0.57
     ferons
    -0.56
    GraphicsUnit
    -0.55
    awtextra
    -0.54
     corona
    -0.53
    numerusform
    -0.53
    oster
    -0.53
    POSITIVE LOGITS
    nito
    0.54
    IEVE
    0.52
    udia
    0.52
    ніципалі
    0.52
    ligent
    0.52
     chronique
    0.50
     digress
    0.49
     Geben
    0.49
    ocide
    0.49
    Бележки
    0.48
    Act Density 0.254%

    No Known Activations