INDEX

Explanations

color-related elements or attributes in the text

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

itters

-0.08

lef

-0.08

zon

-0.07

adena

-0.07

ERSHEY

-0.06

ISCO

-0.06

weit

-0.06

 Ð·Ð°Ð²Ð¸

-0.06

abez

-0.06

udden

-0.06

POSITIVE LOGITS

edio

0.07

iva

0.06

ãģıãĤĵ

0.06

 kicker

0.06

ary

0.06

Mag

0.06

red

0.06

 gold

0.06

ìĥī

0.06

 recon

0.06

Activations Density 0.004%