INDEX

Explanations

loyal servant, object, example

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

bil

-0.11

Ply

-0.09

auce

-0.09

 rept

-0.09

 nÄĽho

-0.09

aru

-0.08

 è´

-0.08

 bilm

-0.08

 Tavern

-0.08

QUI

-0.08

POSITIVE LOGITS

Car

0.10

 example

0.10

 Example

0.10

 basketball

0.09

ofs

0.09

 Burk

0.09

car

0.09

.apple

0.09

 Apple

0.09

Activations Density 0.749%