INDEX
    Explanations

    sections where it encourages the reader to continue reading

    New Auto-Interp
    Negative Logits
    »Ĵ
    -0.85
    è¦ļéĨĴ
    -0.69
     Gleaming
    -0.68
    owl
    -0.64
    folk
    -0.61
    ription
    -0.60
    escription
    -0.59
    Downloadha
    -0.59
     Gamble
    -0.59
    ignt
    -0.58
    POSITIVE LOGITS
     Below
    0.74
     âĨĴ
    0.72
     isEnabled
    0.67
     BELOW
    0.64
    ...]
    0.64
     ARTICLE
    0.58
    below
    0.57
    hook
    0.57
    acters
    0.57
     ETH
    0.56
    Act Density 0.025%

    No Known Activations