INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     NPs
    0.68
    ibo
    0.66
     تیاری
    0.65
    ndef
    0.64
    0.63
    fate
    0.63
    oot
    0.62
    ococ
    0.62
     nito
    0.62
    cribes
    0.62
    POSITIVE LOGITS
    <0xE3>
    1.02
    	
    0.92
        
    0.82
    0.75
         
    0.74
     leftWheel
    0.74
    					
    0.72
    		
    0.72
               
    0.71
                       
    0.71
    Act Density 3.267%

    No Known Activations