INDEX
    Explanations

    repeating patterns or symbols

    New Auto-Interp
    Negative Logits
    +#+#
    -1.06
     ffilmiau
    -0.93
    %</
    -0.88
    PreferredItem
    -0.88
     HasFactory
    -0.88
    '),
    
    -0.83
    .",
    
    -0.83
    ImageContext
    -0.82
     cherchés
    -0.82
    QMetaType
    -0.82
    POSITIVE LOGITS
      
    0.94
       
    0.67
    ↵↵
    0.56
          
    0.56
     ‏
    0.54
    ↵↵↵
    0.54
    0.54
    ↵↵↵↵↵
    0.51
           
    0.49
                  
    0.49
    Act Density 0.183%

    No Known Activations