INDEX

Explanations

key terms and concepts associated with communication and understanding

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

fte

-0.07

 Heller

-0.06

bra

-0.06

ê³¼ìĿĺ

-0.06

cko

-0.06

ÐµÑĦ

-0.06

lier

-0.06

lein

-0.06

rst

-0.06

upe

-0.06

POSITIVE LOGITS

_ASSUME

0.07

 identified

0.07

.sessions

0.07

èĥĨ

0.07

 boobs

0.07

identified

0.07

é¼ĵ

0.07

-Identifier

0.07

kills

0.06

 effective

0.06

Activations Density 0.000%