INDEX

Explanations

simulate hypothetical actions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

<strong>

0.64

扛

0.45

<h5>

0.44

বর্তমানে

0.43

<em>

0.42

Pressure

0.41

 Kardashians

0.41

Commercial

0.41

 Dahmer

0.41

Businesses

0.40

POSITIVE LOGITS

</b>

0.48

 goto

0.42

 amino

0.41

 ferrugineux

0.41

pep

0.41

 unambiguously

0.40

 moneys

0.40

 histidine

0.40

trp

0.39

vc

0.39

Activations Density 0.000%