INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     private
    -1.70
     privately
    -1.61
    private
    -1.55
     privados
    -1.50
     privado
    -1.46
     privati
    -1.46
     privés
    -1.45
     privat
    -1.42
     PRIVATE
    -1.41
     privée
    -1.41
    POSITIVE LOGITS
    ist
    0.59
     sector
    0.58
    id
    0.55
    ia
    0.52
    is
    0.50
    li
    0.49
    2
    0.48
    de
    0.47
    ili
    0.46
    ef
    0.46
    Act Density 0.280%

    No Known Activations