INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 sane

0.51

professor

0.45

nearby

0.44

isolated

0.43

neighbor

0.41

painting

0.41

 takiego

0.41

associazione

0.40

painter

0.39

곤

0.39

POSITIVE LOGITS

 Unusual

0.50

 Contribution

0.46

pInBuffer

0.46

 Contributions

0.46

味

0.45

 Coul

0.45

 ions

0.45

 Servicio

0.44

 JANUARY

0.44

 Tamaño

0.44

Activations Density 0.000%

No Known Activations