INDEX
    Explanations

    occurrences of specific characters or symbols, particularly from non-Latin scripts

    New Auto-Interp
    Negative Logits
    KURZBESCHREIBUNG
    -0.54
     gynnwys
    -0.44
     Tecnología
    -0.42
     sonriente
    -0.40
     AssemblyCompany
    -0.40
     hospod
    -0.40
    -0.39
     Meksiku
    -0.39
    分別
    -0.37
     sostenibilidad
    -0.37
    POSITIVE LOGITS
     ج
    1.64
    ج
    1.41
     الج
    1.27
     ज
    0.96
     والج
    0.91
     ג
    0.89
    الج
    0.84
    ג
    0.77
     জ
    0.71
     j
    0.68
    Act Density 0.002%

    No Known Activations