asking why or understanding problems

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 road

0.73

 hostage

0.66

0.65

nan

0.65

//

0.64

0.63

0.62

0.61

POSITIVE LOGITS

Understanding

1.15

Choosing

1.15

Finding

1.13

Insights

1.12

Selecting

1.06

Reasons

1.03

Characteristics

1.02

Advantages

1.01

Problems

1.01

Why

1.00

Activations Density 0.000%