INDEX

Explanations

Introduces example/transition

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Namely

-1.04

 Theſe

-0.92

 ProtoMessage

-0.88

 betweenstory

-0.85

 onCancelled

-0.82

parsedMessage

-0.81

 beginnetje

-0.80

 виправивши

-0.77

AccessorTable

-0.76

tvguidetime

-0.76

POSITIVE LOGITS

 though

0.61

to

0.59

yet

0.58

 able

0.56

 with

0.54

 like

0.51

 operated

0.51

 about

0.50

 speaking

0.49

Activations Density 0.060%