INDEX
    Explanations

    occurrences of specific formatting or annotation patterns in code comments

    New Auto-Interp
    Negative Logits
    abile
    -0.15
    jn
    -0.15
    hic
    -0.14
    219
    -0.14
    ahoma
    -0.14
    ase
    -0.13
    kám
    -0.13
    wav
    -0.13
    Hell
    -0.13
    inent
    -0.13
    POSITIVE LOGITS
     @
    0.18
    @g
    0.16
    aint
    0.15
    version
    0.15
    /{{
    0.14
    ingleton
    0.14
     version
    0.14
     Roh
    0.14
    lich
    0.13
     Smy
    0.13
    Act Density 0.005%

    No Known Activations