INDEX
    Explanations

    references to sections and theorems in a mathematical context

    New Auto-Interp
    Negative Logits
    akit
    -0.17
    .metro
    -0.15
    richt
    -0.15
    byn
    -0.14
     ``(
    -0.13
    ([(
    -0.13
    olo
    -0.13
    lass
    -0.13
     Bolt
    -0.13
    _SHADOW
    -0.13
    POSITIVE LOGITS
     \
    0.25
    -\
    0.20
     LABEL
    0.20
    ~
    0.20
     cref
    0.20
     \@
    0.17
     Demir
    0.17
    (\
    0.16
    oref
    0.16
     labeled
    0.15
    Act Density 0.046%

    No Known Activations