positive descriptions and outcomes

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 السياس

-1.66

ଵ

-1.47

 бампер

-1.44

 kolejny

-1.33

 sympathique

-1.30

 Störungen

-1.27

 kawaida

-1.27

olución

-1.25

幸好

-1.25

 Dichtung

-1.23

POSITIVE LOGITS

has

1.38

1.37

my

1.31

//});

1.09

1.07

 Азии

1.06

Jang

1.05

Ita

1.03

1.02

Activations Density 0.022%