INDEX
    Explanations

    patterns of repeated characters or symbols in sequences

    repeated sequences or patterns in the text

    New Auto-Interp
    Negative Logits
      
    -1.20
       
    -1.02
         
    -0.84
        
    -0.82
     ”
    -0.82
     )
    -0.79
          
    -0.78
     ?
    -0.78
    -0.78
    -0.77
    POSITIVE LOGITS
    	
    0.87
     photolibrary
    0.82
    		
    0.75
    0.71
    			
    0.68
     myſelf
    0.66
     ARXIV
    0.65
    ↵↵
    0.64
     defaultstate
    0.63
    				
    0.61
    Act Density 1.298%

    No Known Activations