INDEX

Explanations

phrases indicating sincerity or seriousness

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

gamber

-0.07

ocks

-0.07

gne

-0.07

emory

-0.07

htdocs

-0.06

alled

-0.06

 ÐĴÐ¸Ð´

-0.06

Ð°Ð»Ð¾Ð²

-0.06

LObject

-0.06

culos

-0.06

POSITIVE LOGITS

uity

0.07

clist

0.06

McK

0.06

.Arguments

0.06

ë£Į

0.06

osity

0.06

ently

0.06

 suburb

0.06

unal

0.06

Activations Density 0.001%