INDEX
    Explanations

    occurrences of brackets, indicating structured data formats or code

    New Auto-Interp
    Negative Logits
    BibitemShut
    -0.95
    ')}}
    -0.94
    ]})
    -0.92
    ))))))))
    -0.91
    )");
    
    -0.90
    '):
    
    -0.89
    ']?>
    -0.87
    "]];
    -0.86
    ')))
    -0.85
    '))
    
    -0.85
    POSITIVE LOGITS
    [
    3.82
     [
    2.24
    ()[
    2.03
    [\
    2.00
    )[
    1.97
    .[
    1.92
    [(
    1.83
    }[
    1.83
     $[
    1.79
    [$
    1.79
    Act Density 0.435%

    No Known Activations