INDEX

Explanations

terms related to disagreements or dissents

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ìĨĶ

-0.08

ussen

-0.07

iative

-0.07

ÏĥÎ¼Î±ÏĦÎ±

-0.07

BOSE

-0.07

erk

-0.07

erb

-0.07

 tá»Ń

-0.07

inand

-0.07

aggi

-0.07

POSITIVE LOGITS

ively

0.11

ivity

0.10

ors

0.09

ive

0.09

ection

0.08

 stren

0.08

 raised

0.08

able

0.08

0.07

ives

0.07

Activations Density 0.004%