INDEX

Explanations

pass

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 pass

-3.44

 Pass

-2.52

pass

-2.48

Pass

-2.17

 PASS

-1.55

 passes

-1.51

PASS

-1.39

passes

-1.30

 passa

-1.27

pas

-1.24

POSITIVE LOGITS

 Houſe

1.05

 myſelf

1.03

 itſelf

1.01

 raiſ

1.00

 themſelves

0.99

 becauſe

0.97

 Theſe

0.96

 Majefty

0.96

 pleaſure

0.96

 poffible

0.94

Activations Density 0.018%