INDEX
    Explanations

    structurally defined elements in code, particularly those related to functions and blocks

    New Auto-Interp
    Negative Logits
    _Texture
    -0.16
     unseen
    -0.15
     Mour
    -0.15
     tslib
    -0.14
     Gry
    -0.14
    implify
    -0.14
    useppe
    -0.14
     dc
    -0.14
    ome
    -0.14
    erb
    -0.14
    POSITIVE LOGITS
             
    0.25
               
    0.24
              
    0.22
    0.20
            
    0.19
                
    0.18
    aille
    0.18
    atham
    0.17
    adera
    0.16
                 
    0.15
    Act Density 0.057%

    No Known Activations