INDEX

Explanations

instances of criticism and its emotional impact on individuals

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

):?>↵

-0.08

arz

-0.07

stup

-0.07

ÑģÑĥ

-0.07

ÅĻez

-0.07

akte

-0.07

æĸ¹éĿ¢

-0.07

elli

-0.07

 blas

-0.06

iliar

-0.06

POSITIVE LOGITS

 punitive

0.07

 Lamb

0.06

ohl

0.06

theless

0.06

 casc

0.06

 gossip

0.06

 confront

0.06

 closed

0.06

ÎļÎ±

0.05

 atmos

0.05

Activations Density 0.015%