INDEX
Explanations
`lt` and `less than` comparisons
New Auto-Interp
Negative Logits
眄
0.38
ânt
0.37
adamia
0.37
idescent
0.36
TestCase
0.35
â
0.35
âl
0.35
az
0.35
Yet
0.35
AST
0.34
POSITIVE LOGITS
1.52
1.27
1.07
0.88
0.83
0.75
0.65
0.64
0.64
0.57
Activations Density 0.015%