INDEX

Explanations

Experiment

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 apakah

1.96

 WHETHER

1.96

 whether

1.95

<unused130>

1.90

<unused1248>

1.88

 QUESTIONS

1.87

 inferences

1.83

<unused1237>

1.81

<unused758>

1.80

<unused1511>

1.79

POSITIVE LOGITS

1.42

at

1.27

1.15

ap

1.09

ite

1.07

jer

1.07

 владе

1.06

iteli

1.06

it

1.04

Activations Density 0.097%