INDEX

Explanations

references to family relationships and dynamics

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ÅĤÄħ

-0.09

orst

-0.08

ALSE

-0.08

.qual

-0.08

ÙĤØ¨

-0.08

uron

-0.07

znÃ¡m

-0.07

stÃ¼t

-0.07

 dÅĻÃŃ

-0.07

uts

-0.07

POSITIVE LOGITS

pre

0.07

 ages

0.07

 blind

0.07

Inf

0.07

 late

0.06

 mini

0.06

age

0.06

 newly

0.06

 younger

0.06

 plan

0.06

Activations Density 0.005%