INDEX
Explanations
variable declarations and assignments in code
New Auto-Interp
Negative Logits
<eos>
-0.32
élé
-0.30
pleaſure
-0.29
caoutchouc
-0.28
vectorielles
-0.27
chrétiens
-0.27
ennemi
-0.25
onlyOwner
-0.25
souverain
-0.25
vôtre
-0.25
POSITIVE LOGITS
パンチラ
0.89
<unused20>
0.88
<pad>
0.87
[@BOS@]
0.87
<unused43>
0.87
⏔
0.87
<unused28>
0.87
<unused23>
0.87
<unused3>
0.87
<unused14>
0.87
Activations Density 0.013%