INDEX

Explanations

activation of key consequences

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 آواز

0.43

 einzigen

0.43

 izol

0.43

 inconvén

0.43

 advantage

0.42

貳

0.41

灘

0.41

 вертика

0.41

ocsát

0.41

驄

0.40

POSITIVE LOGITS

YL

0.40

 renewed

0.39

utu

0.39

 wholesome

0.39

ELI

0.38

ን

0.38

 Community

0.38

complete

0.38

 shared

0.37

 repeats

0.37

Activations Density 0.859%