INDEX
    Explanations

    occurrences of punctuation marks

    New Auto-Interp
    Negative Logits
    ech
    -0.15
    вано
    -0.14
    leet
    -0.14
    _cast
    -0.14
    ucus
    -0.14
    leur
    -0.13
    éĶĢ
    -0.13
    plusplus
    -0.13
    iel
    -0.13
    ione
    -0.13
    POSITIVE LOGITS
    ãĥĥãĥĦ
    0.17
     ÙħÛĮÙĦادÛĮ
    0.17
     æŃ£
    0.14
     cir
    0.14
    ÏĦιÏĥ
    0.14
    âĤ
    0.13
    andes
    0.13
    hone
    0.13
     ÃĤ
    0.13
    ningen
    0.13
    Act Density 0.042%

    No Known Activations