INDEX

Explanations

instances of politically charged statements or events

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 according

-0.07

çı

-0.06

OPS

-0.06

ÙģÙĩ

-0.06

ÐŀÐł

-0.06

Ã±a

-0.06

according

-0.06

iper

-0.06

WORD

-0.06

ellas

-0.06

POSITIVE LOGITS

esktop

0.07

/REC

0.06

andom

0.06

/my

0.06

no

0.06

 kata

0.06

urer

0.06

 rÃ¡m

0.06

yles

0.06

don

0.06

Activations Density 0.012%