INDEX

Explanations

overwhelmed or overwhelming

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

IFn

-0.10

ç£¨

-0.09

Å¾dy

-0.09

 Jung

-0.09

agara

-0.09

andom

-0.09

ilere

-0.09

itioner

-0.09

à¸£à¸ģ

-0.09

POSITIVE LOGITS

ingly

0.24

 Kelley

0.11

tures

0.11

 amount

0.10

 senses

0.10

 majority

0.10

ture

0.10

top

0.10

 Gore

0.10

came

0.10

Activations Density 0.014%