INDEX
    Explanations

    sentences that contain punctuation, specifically periods

    New Auto-Interp
    Negative Logits
    aar
    -0.16
    x
    -0.14
    ience
    -0.14
    compass
    -0.14
    emin
    -0.14
    aat
    -0.13
     Cain
    -0.13
    oms
    -0.13
     бÑĥдÑĤо
    -0.13
    pred
    -0.12
    POSITIVE LOGITS
    rient
    0.15
    ê¸ī
    0.15
    ıs
    0.15
    iversit
    0.15
    raq
    0.15
    odyn
    0.14
    shm
    0.14
    zeÅĦ
    0.13
    .her
    0.13
    agues
    0.13
    Act Density 0.017%

    No Known Activations