INDEX

Explanations

topics related to authority and societal challenges

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

roys

-0.08

.SetToolTip

-0.07

oy

-0.07

atel

-0.07

antis

-0.07

egend

-0.07

olicit

-0.07

ÐºÐ¾Ð²Ð¾

-0.07

ilda

-0.07

ilde

-0.07

POSITIVE LOGITS

 increasingly

0.10

åį´

0.08

 becomes

0.08

 become

0.08

 wonder

0.08

åį»

0.07

 Wonder

0.07

focus

0.07

 wondered

0.06

 increasing

0.06

Activations Density 0.061%