INDEX
    Explanations

    sections of text with no significant activations, indicating a lack of relevant content

    Non-English text and code snippets

    code, libraries, math, or programming context

    New Auto-Interp
    Negative Logits
    </h1>
    -0.84
    <b>
    -0.81
    </b>
    -0.80
      
    -0.79
    </h3>
    -0.77
    <strong>
    -0.73
    </h2>
    -0.64
    -0.61
    </strong>
    -0.60
    <u>
    -0.59
    POSITIVE LOGITS
     كومونز
    1.15
     الرياضيه
    1.12
     CreateTagHelper
    1.12
    TestingModule
    1.08
     '\\;'
    1.00
    Autoritní
    0.99
     незавершена
    0.99
     فريبيس
    0.99
    uxxxx
    0.94
    енча
    0.94
    Act Density 0.011%

    No Known Activations