INDEX
    Explanations

    blocks of code or structured data formats

    New Auto-Interp
    Negative Logits
    Knife
    -0.16
    egan
    -0.16
    adx
    -0.16
     Knife
    -0.15
    567
    -0.15
             
    -0.15
    agan
    -0.15
            
    -0.14
     happiness
    -0.14
     Kag
    -0.14
    POSITIVE LOGITS
                               
    0.42
                              
    0.30
                             
    0.27
                                
    0.27
    19
    0.23
                            
    0.23
    						
    0.22
    ****************************
    0.22
     						
    0.21
                           
    0.21
    Act Density 0.018%

    No Known Activations