INDEX
Explanations
mathematical formulas or expressions, specifically those with variable definitions and conditions
New Auto-Interp
Negative Logits
Monfieur
-1.13
Efq
-1.07
myſelf
-1.00
itſelf
-0.92
himſelf
-0.90
Majefty
-0.89
becauſe
-0.89
Jefus
-0.87
Houſe
-0.86
themſelves
-0.86
POSITIVE LOGITS
↵↵
0.83
\]
0.80
}}}$
0.71
}$
0.71
})$,
0.67
↵
0.67
])));
0.67
]));
0.65
})$
0.65
</blockquote>
0.64
Activations Density 0.368%