INDEX
    Explanations

    repeated occurrences of the word "the"

    New Auto-Interp
    Negative Logits
    -1.21
     propOrder
    -1.06
     wikipagina
    -0.95
     snippetHide
    -0.90
     ویکی‌پدیای
    -0.87
     nakalista
    -0.86
    ^(@)
    -0.86
     Roskov
    -0.85
    OOTDTY
    -0.85
     myſelf
    -0.84
    POSITIVE LOGITS
    the
    2.59
    The
    1.97
    THE
    1.84
     THE
    1.61
     The
    1.41
     the
    1.31
    thes
    1.21
    ethe
    1.09
    ithe
    1.08
    sthe
    1.00
    Act Density 0.101%

    No Known Activations