INDEX
    Explanations

    mentions of specific events or incidents

    occurrences of punctuation marks, particularly periods, indicating the end of sentences

    New Auto-Interp
    Negative Logits
     tremend
    -0.95
     subsidized
    -0.71
     harbor
    -0.70
     gobl
    -0.67
    oggles
    -0.66
     corrid
    -0.65
     leveled
    -0.65
     honoring
    -0.64
    ãĤ¼ãĤ¦ãĤ¹
    -0.64
     stabilization
    -0.64
    POSITIVE LOGITS
    1.30
    <|endoftext|>
    1.04
     ®
    1.03
     However
    0.96
     Pict
    0.95
     Whilst
    0.94
    ↵↵
    0.91
     Alternatively
    0.86
    Shape
    0.85
     Ministers
    0.84
    Act Density 0.297%

    No Known Activations