INDEX
    Explanations

    references to changes, particularly in context to policy, documentation, and any specified numerical or categorical classifications

    New Auto-Interp
    Negative Logits
     المعيارى
    -0.45
     león
    -0.41
     desierto
    -0.39
     múltiple
    -0.38
     cív
    -0.38
     ceinture
    -0.38
     animación
    -0.38
     nativo
    -0.37
     abandonado
    -0.37
     inconnu
    -0.37
    POSITIVE LOGITS
    UserScript
    0.59
     Савезне
    0.59
    thâu
    0.53
     '\\;'
    0.52
    WriteTagHelper
    0.51
     beginnetje
    0.51
     Италијани
    0.51
    NameInMap
    0.51
    Chham
    0.50
    хьтан
    0.50
    Act Density 0.095%

    No Known Activations