INDEX

Explanations

hackers and likely outcomes

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 insecticide

0.48

 Stap

0.47

 prognosis

0.47

 staple

0.46

 Willie

0.46

항

0.45

 Shark

0.45

 range

0.44

 hurdle

0.44

 tuna

0.44

POSITIVE LOGITS

ني

0.62

ر

0.55

omer

0.51

 regretted

0.50

 slecht

0.50

اين

0.50

atoren

0.49

erver

0.49

 accuses

0.49

átiles

0.49

Activations Density 0.000%