INDEX

Explanations

phrases indicating a positive reaction or approval

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

aid

-0.06

oppable

-0.06

illis

-0.06

weetalert

-0.06

 tongue

-0.06

uffy

-0.06

ëł

-0.05

 Eclipse

-0.05

æĹıèĩªæ²»

-0.05

Î¹Î¿ÏĤ

-0.05

POSITIVE LOGITS

 much

0.07

Æ°á»Ľ

0.07

.bo

0.06

áº¹n

0.06

oha

0.06

uco

0.06

 cries

0.06

 mutual

0.06

 great

0.06

 trÃº

0.06

Activations Density 0.021%