INDEX

Explanations

erotic encounters

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 summ

-0.10

 peoples

-0.09

Hob

-0.09

 awesome

-0.09

 repl

-0.09

 Romantic

-0.09

lotte

-0.08

summ

-0.08

 credentials

-0.08

POSITIVE LOGITS

éĨ´

0.10

 strength

0.10

raw

0.09

}elseif

0.09

 libert

0.09

raw

0.09

Bold

0.09

Raw

0.09

 UNIQUE

0.09

çµ¡

0.08

Activations Density 0.016%