INDEX

Explanations

ability

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 CreateTagHelper

-0.71

 feroit

-0.63

featureID

-0.61

 réaliste

-0.61

 odeur

-0.60

 eventuali

-0.60

 étoit

-0.59

and

-0.59

 morire

-0.59

 ifrån

-0.57

POSITIVE LOGITS

NOPQRST

0.58

 convince

0.55

 createState

0.54

 Provided

0.53

 somehow

0.53

bkz

0.52

 kasarigan

0.52

 gyrus

0.50

omorphisms

0.50

elts

0.49

Activations Density 0.001%