INDEX
    Explanations

    special characters or formats in code snippets

    parentheses and brackets

    New Auto-Interp
    Negative Logits
    ']))
    
    -0.74
    ')));
    -0.72
    ")));
    
    -0.70
    ]))
    
    -0.69
    ")))
    -0.67
    ')))
    -0.67
    '))
    
    -0.66
    ']);
    
    -0.66
    ")));
    -0.66
    ]')
    -0.65
    POSITIVE LOGITS
    (".",
    1.20
    ("/",
    1.14
    ([],
    1.12
    (_,
    1.08
    (',',
    1.05
    ('.',
    1.05
    ("",
    1.05
    (",",
    0.99
     (_,
    0.96
    ('',
    0.94
    Act Density 0.004%

    No Known Activations