INDEX

Explanations

phrases indicating reluctance or avoidance

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

acman

-0.08

osci

-0.08

ÑĩÐ¸Ñģ

-0.08

ertino

-0.07

ãģĿãģĹãģ¦

-0.07

yiy

-0.07

à¸£à¸ĵ

-0.07

importe

-0.07

ÐµÑĨ

-0.07

 amen

-0.07

POSITIVE LOGITS

isode

0.06

 âĢĲ

0.05

ourt

0.05

Callbacks

0.05

 jurisdiction

0.05

Pik

0.05

 repo

0.05

Activations Density 0.045%