INDEX
Explanations
HTML attributes and assignment statements
<start_of_turn>user
New Auto-Interp
Negative Logits
TagMode
-0.84
SequentialGroup
-0.84
<unused74>
-0.81
<unused8>
-0.80
<unused28>
-0.80
<pad>
-0.80
<unused14>
-0.80
<unused79>
-0.80
<unused16>
-0.80
<unused41>
-0.80
POSITIVE LOGITS
"
0.62
("0.59
="
0.58
:@"
0.52
'
0.52
["
0.52
“
0.51
='
0.51
('0.51
‘
0.47
Activations Density 0.001%