INDEX

Explanations

references to manipulation and exploitation within societal and political contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

maz

-0.07

zn

-0.07

wick

-0.07

uitka

-0.07

ilan

-0.06

 å¸Ĥ

-0.06

alah

-0.06

undy

-0.06

Ø§Ø·ÙĦ

-0.06

xmm

-0.06

POSITIVE LOGITS

Ìģ

0.06

-C

0.06

.setViewport

0.06

 satellites

0.05

haven

0.05

abad

0.05

ØŃØ±

0.05

Ald

0.05

soever

0.05

Arb

0.05

Activations Density 0.130%