INDEX

Explanations

phrases indicating positive or uplifting information

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

bserv

-0.08

erb

-0.08

Ð»ÑĥÐ³

-0.07

Ð²Ð¸Ñī

-0.07

ayment

-0.07

UGH

-0.07

icens

-0.07

 WHETHER

-0.07

ewn

-0.07

ãĢ

-0.06

POSITIVE LOGITS

otto

0.08

 hereby

0.06

beits

0.06

uzu

0.06

 Trend

0.06

=""/>↵

0.06

 æ²

0.06

 unlike

0.06

andi

0.06

 finally

0.06

Activations Density 0.016%