INDEX
    Explanations

    mathematical symbols and variables within equations

    New Auto-Interp
    Negative Logits
    uario
    -0.15
    &#
    -0.15
    efa
    -0.14
    "+"
    -0.14
    anst
    -0.14
    ]+$
    -0.14
     };
    -0.14
     Bett
    -0.14
    }()↵
    -0.14
    &apos
    -0.14
    POSITIVE LOGITS
    )\
    0.49
    ]\
    0.46
    }\
    0.45
     }\
    0.39
    "\
    0.36
    >\
    0.36
     "\
    0.32
    `\
    0.31
    '\
    0.31
    ("\
    0.31
    Act Density 0.099%

    No Known Activations