INDEX
    Explanations

    punctuation marks

    sentences ending with a period

    New Auto-Interp
    Negative Logits
     withd
    -0.98
     manif
    -0.80
     inver
    -0.79
     cabbage
    -0.75
     brut
    -0.74
     prototyp
    -0.74
     scaling
    -0.73
    onga
    -0.70
     challeng
    -0.70
     listeners
    -0.70
    POSITIVE LOGITS
     Retrieved
    1.55
    jpg
    1.46
     Accessed
    1.32
    png
    1.25
    htm
    1.17
    txt
    1.12
    pdf
    1.11
    zip
    1.10
    exe
    1.10
    wav
    1.08
    Act Density 0.298%

    No Known Activations