INDEX

Explanations

possessive `'s` followed by nouns

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

the

0.33

The

0.29

the

0.28

If

0.24

ulates

0.24

</h2>

0.23

eteries

0.22

:"

0.22

Both

0.22

POSITIVE LOGITS

own

0.40

 eigene

0.26

 собственные

0.25

own

0.25

 propia

0.24

 kendi

0.23

 raincoat

0.23

 prerogative

0.23

 proverbial

0.23

 foray

0.23

Activations Density 0.077%