INDEX

Explanations

concepts related to trial and error learning

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

sher

-0.07

itat

-0.07

onica

-0.06

ilir

-0.06

ÑĮÐµÑĢ

-0.06

asca

-0.06

IMIZE

-0.06

.lesson

-0.06

Ã¢u

-0.06

orest

-0.06

POSITIVE LOGITS

 alone

0.11

 Alone

0.11

 rather

0.10

alone

0.08

rather

0.08

-Based

0.08

-alone

0.07

-based

0.07

 Rather

0.07

Activations Density 0.039%