INDEX
    Explanations

    punctuation marks, specifically the period

    New Auto-Interp
    Negative Logits
    bob
    -0.15
     Moody
    -0.15
     Tel
    -0.15
    ulty
    -0.15
     Deborah
    -0.14
    à¥įवप
    -0.14
     Folding
    -0.14
    /Delete
    -0.14
     area
    -0.14
     Deb
    -0.14
    POSITIVE LOGITS
    çĵ¶
    0.19
     @}
    0.16
    ector
    0.15
    alt
    0.15
    Ľå»º
    0.14
    ffer
    0.14
    avier
    0.14
    دÛĮد
    0.14
    atten
    0.14
     éħ
    0.14
    Act Density 0.000%

    No Known Activations