INDEX
    Explanations

    patterns related to mathematical or logical expressions, particularly those involving parentheses and brackets

    New Auto-Interp
    Negative Logits
    (
    -1.46
     (
    -1.27
    [
    -1.25
    _
    -1.10
    1
    -1.10
     [
    -1.07
    -
    -1.05
    2
    -1.04
    '
    -1.04
    n
    -1.01
    POSITIVE LOGITS
    "])
    
    4.06
    ")));
    
    4.06
    ']))
    
    3.95
    )");
    
    3.83
    "]);
    
    3.81
    })$}
    3.81
    ')")
    3.79
    .)}
    3.78
    "]));
    3.78
    '))
    
    3.69
    Act Density 0.535%

    No Known Activations