INDEX
    Explanations

    phrases ending in a particular punctuation mark

    punctuation marks, specifically periods indicating the end of sentences

    New Auto-Interp
    Negative Logits
     metic
    -0.66
    anmar
    -0.58
     IST
    -0.54
    IAS
    -0.52
    opter
    -0.52
     artif
    -0.52
     Kyoto
    -0.51
     NYT
    -0.51
     FB
    -0.51
     Canaver
    -0.50
    POSITIVE LOGITS
    0.97
    SPONSORED
    0.82
    <|endoftext|>
    0.76
    gard
    0.72
    ↵↵
    0.66
    ppard
    0.66
     
    0.57
    Lt
    0.55
    ii
    0.53
     (*
    0.53
    Act Density 0.182%

    No Known Activations