INDEX

Explanations

that introduces a descriptive clause

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 There

0.21

都

0.20

 creando

0.18

}$.

0.18

 یعنی

0.18

เลือก

0.18

There

0.18

Holder

0.17

 الذين

0.17

'.$

0.17

POSITIVE LOGITS

 hasn

0.29

 wasn

0.29

 resembles

0.29

isn

0.28

 nonetheless

0.28

 nevertheless

0.26

 differs

0.26

 hopefully

0.26

 operates

0.26

we

0.25

Activations Density 0.240%