INDEX

Explanations

manual

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

agas

-0.26

 Parkway

-0.24

 Sydney

-0.24

obra

-0.24

.hits

-0.24

gx

-0.24

åħ¶

-0.24

æľīä½ķ

-0.24

å¼¹

-0.24

ç³»

-0.23

POSITIVE LOGITS

dia

0.31

 dressing

0.29

tons

0.28

isko

0.28

ize

0.28

æīĵæī®

0.28

ised

0.28

istic

0.27

methods

0.27

 methods

0.27

Activations Density 0.045%