INDEX
    Explanations

    function definitions and object-oriented programming structures

    New Auto-Interp
    Negative Logits
    (",")↵
    -0.16
    ()↵↵
    -0.15
    []↵
    -0.14
    ;",↵
    -0.14
     bureaucr
    -0.14
    [:]↵
    -0.14
     Golden
    -0.14
    era
    -0.14
    .@
    -0.14
    (',')↵
    -0.14
    POSITIVE LOGITS
    ):↵
    0.58
     ):↵
    0.47
    ):↵↵
    0.46
    "):↵
    0.44
    ]:↵
    0.43
    '):↵
    0.42
    ']:↵
    0.40
    "]:↵
    0.39
    ]):↵
    0.39
    :↵
    0.38
    Act Density 0.010%

    No Known Activations