INDEX

Explanations

phrases indicating the experience of overcoming challenges or difficulties

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

mrt

-0.07

redo

-0.07

ismet

-0.07

unta

-0.07

Ø¯Ø§

-0.06

adm

-0.06

Ð¾ÑĤÐ¾ÑĢ

-0.06

ptic

-0.06

ouz

-0.06

_PROF

-0.06

POSITIVE LOGITS

eras

0.07

ards

0.07

ARDS

0.06

 Eisen

0.06

pell

0.06

 adolescence

0.06

iÐ²

0.06

ess

0.06

ible

0.05

Activations Density 0.009%