INDEX
    Explanations

    tokens that denote structured technical identifiers or labels—such as IDs, variable/field names, and separator punctuation—within code-like or formatted lists.

    New Auto-Interp
    Negative Logits
     ग्रह
    0.42
    stro
    0.41
     بدء
    0.41
    denes
    0.40
     }{}_{\
    0.38
     فاط
    0.37
    combination
    0.37
     तेज़
    0.36
     WLR
    0.36
     מצ
    0.36
    POSITIVE LOGITS
     logo
    0.39
     explicitly
    0.39
     doc
    0.37
     artic
    0.37
     viol
    0.34
     explicit
    0.34
    DOCTYPE
    0.34
     logos
    0.34
     navy
    0.33
    logo
    0.33
    Act Density 0.011%

    No Known Activations