INDEX
Explanations
mathematical symbols or expressions in the text
New Auto-Interp
Negative Logits
Strickland
-0.76
Leland
-0.75
Miy
-0.74
AsUp
-0.71
Sanderson
-0.71
Loma
-0.71
ాన
-0.71
Vic
-0.70
Vanden
-0.69
mael
-0.68
POSITIVE LOGITS
\]
2.02
</blockquote>
1.16
\]
1.05
])))
1.05
↵↵
1.04
}\]
1.03
)})
0.99
}}}}
0.98
}})
0.97
"]))
0.93
Activations Density 0.126%