INDEX
    Explanations

    punctuation marks, particularly periods

    New Auto-Interp
    Negative Logits
    lear
    -0.07
       
    -0.07
    inand
    -0.06
    oader
    -0.06
    -UA
    -0.05
    öl
    -0.05
     Sys
    -0.05
    ¶
    -0.05
    lete
    -0.05
     SYS
    -0.05
    POSITIVE LOGITS
     recently
    0.08
     Recently
    0.08
     sometimes
    0.08
     lately
    0.08
     Yet
    0.07
    ÐIJÑĢÑħÑĸв
    0.07
     begs
    0.07
    dden
    0.07
    recent
    0.07
     BUT
    0.07
    Act Density 0.048%

    No Known Activations