INDEX
    Explanations

    lists or examples outlined in a structured format

    instances of self-reflection or examples that illustrate a point

    New Auto-Interp
    Negative Logits
    etheless
    -0.60
    sec
    -0.59
    )))
    -0.57
     sqor
    -0.56
    isine
    -0.56
    ascus
    -0.54
    VERTISEMENT
    -0.53
    clerosis
    -0.53
    ]).
    -0.52
    ])
    -0.52
    POSITIVE LOGITS
     vowel
    0.52
     versus
    0.52
     hypothetical
    0.51
     Slate
    0.49
     divided
    0.47
     randomly
    0.46
     compute
    0.46
     perceptual
    0.45
     subdiv
    0.45
     subjective
    0.45
    Act Density 2.289%

    No Known Activations