INDEX

Explanations

causes disruption and change

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Depends

0.48

 important

0.46

 where

0.46

 dreamy

0.45

 someone

0.45

 your

0.44

。

0.44

0.43

 when

0.43

 bijzonder

0.43

POSITIVE LOGITS

 menyebabkan

0.64

 prevents

0.63

ทำให้

0.62

 ทำให้

0.62

导致

0.61

 precludes

0.61

導致

0.61

导致的

0.58

 impairs

0.57

 induces

0.57

Activations Density 0.016%