INDEX

Explanations

video that, have the

New Auto-Interp

Top Features by Cosine Similarity

Configuration

Prompts (Dashboard)

10,000 prompts, 128 tokens each

Dataset (Dashboard)

lmsys/lmsys-chat-1m

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ï½°

-0.11

_Tis

-0.10

 Ð¼ÐµÑĤÐ°Ð»Ð»Ð¸

-0.10

togroup

-0.09

isContained

-0.09

 fitte

-0.09

chandle

-0.09

 appearances

-0.09

_Lean

-0.09

----------</

-0.09

POSITIVE LOGITS

 task

0.20

 topic

0.19

 project

0.16

 scenario

0.16

 story

0.16

 event

0.15

ä»»åĬ¡

0.15

é¢ĺ

0.15

task

0.14

 episode

0.14

Activations Density 1.885%