INDEX
    Explanations

    variable names or identifiers in code

    New Auto-Interp
    Negative Logits
    < 
    -0.72
     defaultdict
    -0.71
     Diſ
    -0.70
    */
    
    
    -0.69
     Reſ
    -0.69
    }*/
    -0.67
     ſeveral
    -0.65
     ſee
    -0.65
     ſtand
    -0.65
    ']").
    -0.65
    POSITIVE LOGITS
     x
    1.93
     X
    1.68
    x
    1.64
    getX
    1.44
     getX
    1.41
    X
    1.40
    setX
    1.26
     xanth
    1.20
    xas
    1.17
     Xavier
    1.15
    Act Density 0.394%

    No Known Activations