INDEX

Explanations

phrases indicating positive or good news

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ãĥ¼ãĥĢ

-0.07

cum

-0.06

IDb

-0.06

 Bernstein

-0.06

/Images

-0.06

ved

-0.06

Gem

-0.06

_ISS

-0.06

_predicted

-0.06

blo

-0.06

POSITIVE LOGITS

abyrin

0.07

 news

0.07

rella

0.07

amedi

0.07

ucc

0.07

news

0.07

uent

0.06

gregator

0.06

ÃľM

0.06

usan

0.06

Activations Density 0.004%