INDEX
Explanations
closing brackets in code syntax
closing code blocks
New Auto-Interp
Negative Logits
ſſung
-1.06
ロウィン
-1.04
<pad>
-1.02
<unused8>
-1.02
<unused41>
-1.02
<unused47>
-1.02
<unused28>
-1.02
<unused17>
-1.02
<unused14>
-1.02
<unused16>
-1.02
POSITIVE LOGITS
});
0.88
})));
0.70
}});
0.70
});
0.65
));
0.63
}));
0.60
}];
0.59
}]);
0.59
};
0.59
))));
0.59
Activations Density 0.009%