INDEX

Explanations

concepts related to theoretical frameworks in scientific literature

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

gnore

-0.08

mtx

-0.07

]=>

-0.07

Ø§ÙĦØ¯

-0.06

yg

-0.06

ÏĦÎ¶

-0.06

Sno

-0.06

xcf

-0.06

 guarante

-0.06

Contained

-0.06

POSITIVE LOGITS

 GOODMAN

0.07

ï»¿↵↵

0.07

iola

0.06

orgh

0.06

 Serge

0.06

 %↵

0.06

ÂĿ

0.06

 insider

0.06

ï»¿↵

0.06

ucu

0.06

Activations Density 0.000%