INDEX
    Explanations

    references to health-related concepts and terminology

    technical or academic punctuation and citation markers.

    New Auto-Interp
    Negative Logits
    -0.60
    Tur
    -0.59
     iſt
    -0.59
     headlong
    -0.59
    𝙫
    -0.59
     Houſe
    -0.58
    δες
    -0.58
    bigoplus
    -0.58
    שְׁ
    -0.56
     Syr
    -0.56
    POSITIVE LOGITS
    ,-,
    1.42
    .$,
    1.28
    ′,
    1.24
     }}$,
    1.23
    ++,
    1.16
    °,
    1.16
    ,:),
    1.14
    €,
    1.13
     \%$,
    1.13
    .],
    1.13
    Act Density 2.637%

    No Known Activations