INDEX

Explanations

references to tragic events involving loss of life and community impact

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Prison

-0.07

 jail

-0.07

 prisoner

-0.07

ep

-0.06

åĽ

-0.06

afone

-0.06

Ã¤Ã¤n

-0.06

unc

-0.06

 Composite

-0.06

 accidents

-0.06

POSITIVE LOGITS

 innocent

0.07

 Innoc

0.07

ç«ĭãģ¦

0.07

 innoc

0.06

 targeted

0.06

Å¡it

0.06

 Dort

0.06

arget

0.06

targets

0.06

%"

0.06

Activations Density 0.008%