INDEX
    Explanations

    LaTeX formatting elements and structures used in mathematical expressions

    New Auto-Interp
    Negative Logits
     FetchType
    -0.80
    mente
    -0.71
    orsz
    -0.67
    Wię
    -0.64
     UIButton
    -0.63
     ANIM
    -0.63
    ázaro
    -0.62
     nav
    -0.61
    Παραπομπές
    -0.61
     阅读
    -0.60
    POSITIVE LOGITS
    1.11
    ↵↵
    0.91
        
    0.90
    }{*}{
    0.90
    	
    0.89
          
    0.84
            
    0.83
    		
    0.83
    [toxicity=0]
    0.82
         
    0.82
    Act Density 0.020%

    No Known Activations