INDEX

Explanations

of

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

__':

-0.59

:]:

-0.51

CWE

-0.51

__":

-0.49

 characteristic

-0.49

 signatures

-0.49

}")]

-0.49

 vertrou

-0.47

 noqa

-0.47

("")]

-0.46

POSITIVE LOGITS

AxisAlignment

0.59

the

0.55

 nuages

0.55

Jereo

0.54

 sánh

0.50

щення

0.50

 ostavi

0.49

 having

0.48

achal

0.47

picasso

0.47

Activations Density 0.000%