INDEX

Explanations

phrases related to the occurrence or significance of actions and their effects

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

TAS

-0.06

operand

-0.06

 tion

-0.06

bon

-0.05

 Ã¢

-0.05

_ELEM

-0.05

-widgets

-0.05

bon

-0.05

--

-0.05

POSITIVE LOGITS

Ìģ

0.17

ÌĨ

0.15

Ì

0.12

ÌĢ

0.12

ÃĮ

0.11

Ì§

0.10

ÌĪ

0.10

Ìĥ

0.09

Ìģt

0.09

ãģĵãģĿ

0.09

Activations Density 0.404%