INDEX

Explanations

phrases signaling contradictions or surprises in context

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

lei

-0.06

Ã¡la

-0.06

idar

-0.06

apur

-0.06

Ø§Ø¯

-0.06

ua

-0.06

ÙĬØ¯

-0.06

erg

-0.06

 Meal

-0.06

POSITIVE LOGITS

Ä±klÄ±

0.07

ertype

0.07

aunch

0.06

Rendering

0.06

 damp

0.06

 dipl

0.06

 Mint

0.06

ëłĩ

0.06

Ã¥n

0.06

dre

0.06

Activations Density 0.120%