INDEX
Explanations
occurrences of code-related elements or syntax
New Auto-Interp
Negative Logits
Efq
-0.79
houſe
-0.70
Majefty
-0.70
)";
-0.70
)»
-0.69
Monfieur
-0.69
purpoſe
-0.67
••••
-0.67
]";
-0.66
themſelves
-0.65
POSITIVE LOGITS
</code>
2.05
<code>
1.20
`,
1.08
</h6>
1.06
</th>
0.92
`.
0.84
`
0.82
}`
0.82
</sub>
0.81
</td>
0.77
Activations Density 0.054%