INDEX

Explanations

references to actions of dropping or leaving things behind

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

abr

-0.06

ĳ¸

-0.06

uries

-0.06

leness

-0.06

 Auch

-0.06

 success

-0.06

uish

-0.06

zin

-0.05

xious

-0.05

(clock

-0.05

POSITIVE LOGITS

-drop

0.11

 dropped

0.11

 dropping

0.11

(drop

0.10

 drops

0.10

 drop

0.10

.drop

0.10

 Drop

0.10

 onto

0.10

DROP

0.10

Activations Density 0.021%