INDEX

Explanations

expressions related to making decisions or choices

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 track

-0.06

 radi

-0.06

 butt

-0.06

ãĥ¼ãĥ

-0.06

strtolower

-0.05

ç¥ŀ

-0.05

 acronym

-0.05

ÅĻ

-0.05

ly

-0.05

osc

-0.05

POSITIVE LOGITS

etine

0.09

éĺ¶

0.07

elper

0.07

GenerationStrategy

0.07

 scand

0.07

icontrol

0.07

andes

0.07

racak

0.07

ediator

0.07

Ao

0.07

Activations Density 0.001%