INDEX

Explanations

references to issues related to societal norms and justice narratives

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Wikipedia

-0.08

âĢ¦↵

-0.08

âĢ¦

-0.06

byt

-0.06

âĢ¦.

-0.06

land

-0.06

 among

-0.05

 âĢ¦↵

-0.05

Ì

-0.05

 wikipedia

-0.05

POSITIVE LOGITS

ëĮ

0.09

riba

0.08

Ã¨m

0.07

 Ø§ÙĦØ±ÙħØ²ÙĬØ©

0.07

OptionsMenu

0.07

/*č↵

0.07

Äįel

0.07

à¸§à¸¥

0.07

ÃŃÅ¡

0.07

.gc

0.07

Activations Density 0.065%