INDEX
    Explanations

    instances of structured data formats or programming constructs

    Code, parentheses, brackets, and other symbols

    New Auto-Interp
    Negative Logits
    RenderAtEndOf
    -1.14
    IntoConstraints
    -1.12
    <unused79>
    -1.02
    <unused41>
    -1.02
    <unused52>
    -1.02
    <unused14>
    -1.02
    <unused16>
    -1.02
    <unused8>
    -1.02
    [@BOS@]
    -1.02
    <unused3>
    -1.02
    POSITIVE LOGITS
    ↵↵
    0.78
    <eos>
    0.71
    0.56
    .
    0.54
    ↵↵↵
    0.54
    2
    0.51
    1
    0.47
    The
    0.43
    3
    0.43
    <em>
    0.41
    Act Density 0.730%

    No Known Activations