INDEX
    Explanations

    categorize and present breakdowns

    meta-discourse that organizes an answer—introductions to “breakdowns” or lists, categorizations, section headings, and disclaimers that frame and structure the response.

    New Auto-Interp
    Negative Logits
     নাকি
    0.24
    或者是
    0.22
     during
    0.22
     unsuccessfully
    0.22
     physically
    0.22
     ethylene
    0.22
     দেখেছিলেন
    0.22
     interference
    0.21
     water
    0.21
     occurred
    0.21
    POSITIVE LOGITS
     постара
    0.39
     bahsede
    0.37
     максимально
    0.36
     подробно
    0.36
    aremos
    0.35
     yapacağız
    0.35
     시작하겠습니다
    0.34
     başlayalım
    0.33
    ujourd
    0.32
    分為
    0.32
    Act Density 5.985%

    No Known Activations