INDEX
    Explanations

    references to small-scale entities or concepts

    New Auto-Interp
    Negative Logits
    uvo
    -0.68
     obligé
    -0.68
    גרת
    -0.67
     ISSUED
    -0.66
     Réponses
    -0.66
     lusso
    -0.65
    eraard
    -0.64
     suivantes
    -0.64
     touristique
    -0.64
     Autorizaciones
    -0.64
    POSITIVE LOGITS
     Small
    1.70
     SMALL
    1.66
    Small
    1.65
     small
    1.65
    small
    1.57
    SMALL
    1.51
     smal
    1.48
     Smal
    1.32
    Kleine
    1.16
     kleinen
    1.10
    Act Density 0.076%

    No Known Activations