INDEX
Explanations
repeated sequences of equal signs, likely for formatting or structural emphasis in code
New Auto-Interp
Negative Logits
*********
-0.99
*************
-0.96
*********/
-0.95
************
-0.94
***********
-0.94
********
-0.90
**********
-0.88
{*}-0.88
}^{*}-0.88
***************
-0.87
POSITIVE LOGITS
================
2.19
————————————————
1.05
----------------
1.04
~~~~~~~~~~~~~~~~
0.86
################
0.86
————————
0.84
________________
0.83
................
0.83
▬▬▬▬▬▬▬▬
0.78
qu
0.78
Activations Density 0.215%