INDEX
    Explanations

    punctuation and criteria used to indicate data or results in a structured format

    New Auto-Interp
    Negative Logits
    );
    
    -0.73
    "]))
    -0.69
    "])
    
    -0.68
    ");
    
    -0.67
    '];
    
    -0.66
    ']))
    -0.66
    "];
    
    -0.65
    '])
    
    -0.64
    ());
    
    -0.63
    "])
    -0.61
    POSITIVE LOGITS
    ;
    0.98
    matchCondition
    0.73
    +;
    0.64
     {;
    0.61
     ;
    0.60
    ;;;;
    0.59
    °;
    0.58
    ;;;
    0.57
    *;
    0.57
    %;
    0.57
    Act Density 0.440%

    No Known Activations