INDEX
    Explanations

    words near the beginning of documents

    Hexadecimal representations

    numbers followed by hex codes

    New Auto-Interp
    Negative Logits
    ,
    -0.48
    Ď
    -0.47
    inty
    -0.42
    arde
    -0.41
    Ή
    -0.40
     essentially
    -0.39
    Cre
    -0.39
    𝐠
    -0.38
    .
    -0.38
    Very
    -0.38
    POSITIVE LOGITS
    <bos>
    2.91
    1.20
    '
    1.12
     Савезне
    1.06
    хьтан
    1.00
     disambiguazione
    0.98
    DockStyle
    0.93
    ագրություններ
    0.92
     виправивши
    0.91
    Diwedd
    0.91
    Act Density 0.022%

    No Known Activations