INDEX

Explanations

describing or specifying information

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 approbation

0.42

晚

0.40

髖

0.39

 বিকালে

0.39

छे

0.38

 algebraica

0.38

 trifling

0.38

股

0.38

 Highlights

0.37

牛肉

0.37

POSITIVE LOGITS

asında

0.44

 vecchia

0.43

 Gottlieb

0.43

 reflex

0.42

 vecchio

0.42

 Reflex

0.41

 சை

0.40

reflex

0.40

vors

0.40

eradish

0.40

Activations Density 0.067%