INDEX
Explanations
punctuation marks, specifically commas and parentheses
closing quotes and code blocks
New Auto-Interp
Negative Logits
Chriftian
-0.69
Jefus
-0.56
ſur
-0.53
culte
-0.53
تقاوى
-0.52
Houſe
-0.52
Inſ
-0.52
purpoſe
-0.51
<bos>
-0.51
мәкал
-0.51
POSITIVE LOGITS
`,
1.35
`)
1.05
`.
1.00
>`;
0.97
}`,
0.95
`,
0.94
`;
0.93
`);
0.92
`),
0.90
`).
0.89
Activations Density 0.017%