INDEX
    Explanations

    mathematical or tabular representations of data and equations

    Mathematical/scientific notation and symbols

    mathematical expressions, symbols, headers, copyright

    New Auto-Interp
    Negative Logits
      
    -0.99
       
    -0.82
    -0.74
     });
    
    -0.71
        
    -0.68
     ?>
    
    -0.68
     ");
    
    -0.66
     "))
    -0.66
     );
    
    -0.65
     })
    
    -0.65
    POSITIVE LOGITS
    	
    0.75
    0.73
    		
    0.66
    --
    0.63
    "
    0.61
    </
    0.60
    -->
    0.60
    ↵↵↵
    0.60
    <
    0.59
    					
    0.58
    Act Density 1.246%

    No Known Activations