INDEX
    Explanations

    references to the word "the."

    New Auto-Interp
    Negative Logits
    ContentAlignment
    -0.65
    '},
    
    -0.60
    TagMode
    -0.58
    )++;
    -0.54
    :]:
    -0.53
    ()]);
    -0.52
     snippetHide
    -0.51
     }^{*}
    -0.51
    Fprintf
    -0.50
     ')[
    -0.50
    POSITIVE LOGITS
     during
    0.98
     During
    0.98
    During
    0.96
    during
    0.95
     DURING
    0.87
     periods
    0.74
     period
    0.73
     tijdens
    0.73
     Periods
    0.70
    durante
    0.70
    Act Density 0.052%

    No Known Activations