INDEX
    Explanations

    similar to, time, increase, how

    emphasized or standout key terms and headings in structured instructional text, especially those marked by formatting cues (bold/italics, quotes, slashes, or code-style tokens).

    New Auto-Interp
    Negative Logits
     and
    0.29
     to
    0.28
    ,
    0.28
     or
    0.25
     و
    0.25
     œuvres
    0.25
     -
    0.25
     (
    0.24
    0.24
       
    0.23
    POSITIVE LOGITS
    <unused1861>
    0.28
    <unused742>
    0.24
    <unused2037>
    0.23
    <unused717>
    0.23
    their
    0.23
    <unused321>
    0.23
    <unused1774>
    0.23
    <unused661>
    0.22
    AMP
    0.22
    <unused1658>
    0.22
    Act Density 2.071%

    No Known Activations