INDEX

Explanations

`.` `.)` `/)` characters

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

</th>

1.10

֔

1.00

</h1>

0.95

</h4>

0.94

<unused497>

0.92



0.90

𐰚

0.87

𓇼

0.85

鏘

0.84

<unused2124>

0.84

POSITIVE LOGITS

<eos>

3.02

៕

1.56

↵↵

1.29

↵

1.00

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.88

Conclusion

0.88

<unused61>

0.87

↵↵↵↵↵

0.86

↵↵↵

0.86

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.84

Activations Density 0.600%