INDEX
    Explanations

    repeated sequences of equal signs, likely for formatting or structural emphasis in code

    New Auto-Interp
    Negative Logits
    *********
    -0.99
    *************
    -0.96
    *********/
    -0.95
    ************
    -0.94
    ***********
    -0.94
    ********
    -0.90
    **********
    -0.88
     {*}
    -0.88
     }^{*}
    -0.88
    ***************
    -0.87
    POSITIVE LOGITS
    ================
    2.19
    ————————————————
    1.05
    ----------------
    1.04
    ~~~~~~~~~~~~~~~~
    0.86
    ################
    0.86
    ————————
    0.84
    ________________
    0.83
    ................
    0.83
    ▬▬▬▬▬▬▬▬
    0.78
    qu
    0.78
    Act Density 0.215%

    No Known Activations