INDEX
    Explanations

    programming syntax elements and structures in the code, particularly brackets and statement terminators

    Code snippets or placeholders

    New Auto-Interp
    Negative Logits
    "):
    
    -0.84
    '):
    
    -0.82
     (?,
    -0.78
    <?
    
    -0.75
     —,
    -0.74
    ?',
    -0.74
    )"),
    -0.74
    __',
    -0.73
    {}",
    -0.73
    $',
    -0.73
    POSITIVE LOGITS
       
    0.77
    ↵↵↵
    0.76
      
    0.75
    ↵↵↵↵
    0.64
         
    0.64
    ↵↵↵↵↵
    0.62
           
    0.59
          
    0.57
        
    0.57
    0.56
    Act Density 0.119%

    No Known Activations