INDEX

Explanations

phrases indicating self-referential statements or personal reflections

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

itten

-0.09

-0.07

igm

-0.06

berries

-0.06

arios

-0.06

 anthrop

-0.06

 antic

-0.06

arius

-0.06

ighton

-0.05

ddy

-0.05

POSITIVE LOGITS

asking

0.09

HUD

0.08

 Heard

0.07

ê¶ģ

0.07

Segue

0.07

ç¨¿

0.07

 questions

0.06

 interest

0.06

presso

0.06

mant

0.06

Activations Density 0.021%