references to blog posts or episodes

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

chio

-0.08

rech

-0.06

ing

-0.06

Ã¤m

-0.06

lein

-0.06

_ordered

-0.06

POSITIVE LOGITS

ONTAL

0.08

Untitled

0.08

(éĩĳ

0.07

awy

0.07

theid

0.07

ÙĥÙĬÙĬÙģ

0.07

Ð±ÑĢÑı

0.07

 Aires

0.07

XHR

0.07

 ++)↵

0.07

Activations Density 0.002%