INDEX

Explanations

themes related to censorship and the suppression of public expression

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

atsu

-0.07

ÑĢÐ°ÑģÑĤ

-0.07

à¸£à¸ĵ

-0.07

á»Ļ

-0.07

á»§

-0.07

ĻĤ

-0.06

 Ãºn

-0.06

äºĽ

-0.06

isch

-0.06

Ã¡o

-0.06

POSITIVE LOGITS

 perfectly

0.09

 nowhere

0.08

God

0.06

Ħĸ

0.06

Leg

0.06

 legitimate

0.06

God

0.06

 repeatedly

0.06

ebra

0.06

Activations Density 0.038%