INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

\")

0.65

\...

0.56

\";

0.53

\"

0.51

\"]

0.51

\",

0.50

\*

0.50

\)

0.49

\@

0.49

POSITIVE LOGITS

 Firstly

0.45

 Fortunately

0.44

 Here

0.44

 Unfortunately

0.44

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.43

 বিস্মিত

0.43

 Dopo

0.43

 Depending

0.43

 Nevertheless

0.42

 Після

0.42

Activations Density 3.596%

No Known Activations