INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     socalled
    1.60
     wellknown
    1.45
    -”
    1.10
    1.04
     “[
    0.99
    <unused1146>
    0.98
    (“
    0.96
    ƣ
    0.94
    0.90
    0.89
    POSITIVE LOGITS
    5.50
       
    4.55
        
    3.61
         
    3.25
          
    3.13
           
    2.79
            
    2.59
              
    2.57
             
    2.48
    <0xE3>
    2.35
    Act Density 5.508%

    No Known Activations