INDEX
    Explanations

    markers of highly structured or technical text—acronyms/abbreviations, tagging/format labels, numerals and dates, units, and code- or punctuation-heavy tokens, rather than ordinary prose.

    New Auto-Interp
    Negative Logits
     that
    0.29
     hẳn
    0.27
     (
    0.26
     что
    0.25
     άλλ
    0.25
     Emeritus
    0.24
     their
    0.24
     বিকল্প
    0.24
     Wellington
    0.24
     Communications
    0.24
    POSITIVE LOGITS
    ہار
    0.32
     など
    0.31
     definisi
    0.31
    0.31
     پانچ
    0.31
     イベント
    0.30
    0.30
     letzte
    0.30
     제거
    0.30
     verfol
    0.30
    Act Density 0.733%

    No Known Activations