INDEX
    Explanations

    code structure elements, specifically related to control flow and method invocations in programming languages

    New Auto-Interp
    Negative Logits
    231
    -0.15
    illard
    -0.15
    âij
    -0.15
    234
    -0.15
    29
    -0.15
     Nut
    -0.14
    amba
    -0.14
     Priv
    -0.14
    69
    -0.14
    9
    -0.14
    POSITIVE LOGITS
                           
    0.48
    	                   
    0.25
                          
    0.23
    0.23
                            
    0.23
    		               
    0.21
    ↵                        ↵
    0.20
    			           
    0.20
                            č↵
    0.20
    					
    0.20
    Act Density 0.024%

    No Known Activations