INDEX

Explanations

actions related to authority, control, and interactions involving compliance or discipline

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

leÅŁik

-0.07

UME

-0.07

TERS

-0.07

der

-0.07

anton

-0.07

enu

-0.06

jed

-0.06

dera

-0.06

anders

-0.06

ch

-0.06

POSITIVE LOGITS

him

0.10

us

0.09

 people

0.09

 them

0.08

 someone

0.08

//{{

0.08

 somebody

0.08

 anybody

0.07

 anyone

0.07

 individuals

0.07

Activations Density 0.052%