INDEX

Explanations

how questions and methods

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

}${

0.57

diverse

0.55

ziła

0.52

diversity

0.52

embeddings

0.50

satisf

0.49

jeron

0.48

pita

0.47

}$)

0.47

Combining

0.47

POSITIVE LOGITS

 cooker

0.50

 SHOP

0.50

 shop

0.49

SHOP

0.49

 hotline

0.46

ON

0.45

 manager

0.45

 protector

0.45

 grocery

0.44

 PROBLEM

0.44

Activations Density 0.000%