INDEX
    Explanations

    occurrences of code-related elements or syntax

    New Auto-Interp
    Negative Logits
     Efq
    -0.79
     houſe
    -0.70
     Majefty
    -0.70
    )";
    
    -0.70
    -0.69
     Monfieur
    -0.69
     purpoſe
    -0.67
    ••••
    -0.67
    ]";
    -0.66
     themſelves
    -0.65
    POSITIVE LOGITS
    </code>
    2.05
    <code>
    1.20
    `,
    1.08
    </h6>
    1.06
    </th>
    0.92
    `.
    0.84
    `
    0.82
    }`
    0.82
    </sub>
    0.81
    </td>
    0.77
    Act Density 0.054%

    No Known Activations