INDEX
    Explanations

    programming-related keywords and constructs in code documentation

    New Auto-Interp
    Negative Logits
    ``↵
    -0.27
     */;↵
    -0.26
    ');");↵
    -0.25
    }}↵
    -0.24
    ]-->↵
    -0.23
    "}}↵
    -0.23
    .'''↵
    -0.23
     `;↵
    -0.23
     "))↵
    -0.23
    '];?>↵
    -0.22
    POSITIVE LOGITS
    )↵↵
    0.40
    }↵↵
    0.38
    ())↵↵
    0.38
     "")↵↵
    0.38
    )}↵↵
    0.37
    "}↵↵
    0.35
     |↵↵
    0.35
    "]↵↵
    0.35
    )]↵↵
    0.35
    ']↵↵
    0.35
    Act Density 0.514%

    No Known Activations