INDEX
    Explanations

    presence of specific characters or symbols, particularly the character "ðŁ."

    New Auto-Interp
    Negative Logits
    unate
    -0.16
    寸
    -0.15
    à¹Ĥย
    -0.15
    readcr
    -0.14
     MetroFramework
    -0.14
    aviest
    -0.14
     Virt
    -0.14
    iddi
    -0.13
    .preference
    -0.13
     ÙĨزدÛĮÚ©
    -0.12
    POSITIVE LOGITS
     by
    0.50
    by
    0.36
     oleh
    0.30
    _by
    0.29
    .by
    0.29
     By
    0.27
     تÙĪØ³Ø·
    0.25
    -by
    0.25
     considerably
    0.24
     çͱ
    0.24
    Act Density 0.012%

    No Known Activations