INDEX
    Explanations

    elements that indicate potential errors or issues

    New Auto-Interp
    Negative Logits
    Geplaatst
    -0.95
    styleType
    -0.81
    twimg
    -0.72
    Revenir
    -0.72
     Paglinawan
    -0.70
    adaptiveStyles
    -0.68
    DrawerToggle
    -0.68
    igshid
    -0.68
    Архівовано
    -0.68
     presenti
    -0.67
    POSITIVE LOGITS
      
    0.73
    0.58
    ianum
    0.53
       
    0.51
     Mab
    0.49
    	
    0.48
     ​
    0.48
    xticks
    0.45
    ​​​​
    0.45
     quant
    0.45
    Act Density 0.447%

    No Known Activations