INDEX

Explanations

terms indicating deep significance or importance

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ADE

-0.08

TING

-0.07

term

-0.07

DOG

-0.07

tings

-0.07

asso

-0.07

LEAN

-0.07

-0.06

BOARD

-0.06

 stray

-0.06

POSITIVE LOGITS

ly

0.09

 depths

0.09

/ext

0.08

antly

0.07

 deeply

0.07

ÑģÑĮ

0.07

ively

0.07

 Depths

0.07

est

0.07

Ã©ment

0.07

Activations Density 0.002%