INDEX
    Explanations

    double and single quotation marks in various contexts

    New Auto-Interp
    Negative Logits
    '));
    
    -1.15
    )";
    
    -1.09
    .";
    
    -1.05
    `,
    
    -1.04
    ]');
    -1.02
    )");
    
    -1.01
    "],
    
    -1.01
    %</
    -1.00
    %");
    -0.99
    }');
    -0.99
    POSITIVE LOGITS
    ("
    1.62
     "
    1.52
    ="
    1.45
    :@"
    1.40
    ]=="
    1.34
    1.34
    !("
    1.33
    ('
    1.28
     “
    1.22
    ["
    1.21
    Act Density 0.272%

    No Known Activations