INDEX
    Explanations

    mathematical expressions and equations

    New Auto-Interp
    Negative Logits
    ,↵
    -0.17
    iв
    -0.16
    -0.15
    -0.15
    -↵
    -0.15
     --↵
    -0.14
    .,↵
    -0.14
    ;↵
    -0.14
    -0.14
    -0.14
    POSITIVE LOGITS
     \\
    0.28
     \$
    0.28
    \$
    0.26
    $↵↵
    0.23
     â̦↵↵
    0.23
     âĪĢ
    0.23
    ...\
    0.23
    \\
    0.22
     ...↵↵
    0.22
     ...\
    0.22
    Act Density 0.807%

    No Known Activations