INDEX

Explanations

tack

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Peter

-0.60

 peter

-0.53

Peter

-0.52

struct

-0.52

bol

-0.51

 mobility

-0.50

 isolada

-0.50

хьтан

-0.49

gebob

-0.49

 vector

-0.49

POSITIVE LOGITS

 resourceCulture

0.72

ised

0.63

astéro

0.62

Джерела

0.62

 ――――――――

0.62

 pleaſure

0.60

󠁢

0.60

 themſelves

0.59

msgTypes

0.59

 Theſe

0.58

Activations Density 0.295%