INDEX
Explanations
function call patterns in the code
New Auto-Interp
Negative Logits
eben
-0.60
دانشنامهٔ
-0.57
med
-0.55
&
-0.54
ناف
-0.54
端
-0.54
ایج
-0.53
/#{-0.53
kori
-0.53
والب
-0.53
POSITIVE LOGITS
()
2.12
(()
1.31
(),
1.27
()
1.20
())
1.15
().
1.12
();
1.12
():
1.07
={()1.02
()=>{0.99
Activations Density 0.050%