INDEX

Explanations

dust, mud, ashes, smoke

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

weise

-0.11

(s

-0.10

oise

-0.10

 McKay

-0.10

urf

-0.09

imson

-0.09

ondere

-0.09

iac

-0.09

 waters

-0.09

akening

-0.08

POSITIVE LOGITS

iness

0.15

 pocket

0.11

-covered

0.11

 plug

0.10

-filled

0.10

 Cutter

0.10

INESS

0.10

bin

0.10

ey

0.09

Pocket

0.09

Activations Density 0.091%