INDEX
    Explanations

    punctuation marks and their variation

    New Auto-Interp
    Negative Logits
    èĤ¯
    -0.15
     Ting
    -0.15
    ru
    -0.14
    NU
    -0.14
     automat
    -0.14
     Pik
    -0.14
     resume
    -0.13
    abytes
    -0.13
     fr
    -0.13
     Normal
    -0.13
    POSITIVE LOGITS
    oure
    0.16
    @show
    0.16
    alus
    0.15
    agne
    0.15
    OKIE
    0.15
    rå
    0.15
    /vnd
    0.14
    yro
    0.14
    emble
    0.14
    fty
    0.14
    Act Density 0.003%

    No Known Activations