gardens or experiments

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Gardens

-1.25

 nahilalakip

-1.20

 experiments

-1.12

Experiments

-1.02

 Experiments

-0.99

 gardens

-0.99

experiments

-0.96

 GARD

-0.93

 betweenstory

-0.93

 EXPERIMENTS

-0.91

POSITIVE LOGITS

0.69

0.50

↵↵

0.43

ی

0.42

 about

0.41

0.39

at

0.39

0.37

0.36

Activations Density 0.147%