INDEX

Explanations

hidden instructions and unnecessary repetition

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 tinc

0.51

栦

0.51

}}=

0.49

幵

0.48

প

0.46

or

0.46

僟

0.45



0.45

ilte

0.44

भ

0.43

POSITIVE LOGITS

 strengths

0.50

 quintessential

0.49

 pivotal

0.45

 percol

0.45

0.44

 capabilities

0.43

 competencies

0.43

 мощность

0.43

 permeates

0.43

 инфраструк

0.42

Activations Density 0.001%