INDEX

Explanations

describing specific instances

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

GameObject

0.40

 tubers

0.40

HexString

0.40

Contact

0.38

міну

0.38

Voltage

0.37

 rodziny

0.37

ຣ

0.36

リエステル

0.35

灸

0.35

POSITIVE LOGITS

simply

0.49

 clarity

0.46

that

0.45

comple

0.45

เพื่อให้

0.43

intention

0.43

 தெளி

0.42

 anlaş

0.42

don

0.41

 Simply

0.41

Activations Density 0.001%