INDEX

Explanations

The followed by proper noun

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

en

0.91

in

0.89

IN

0.89

the

0.89

of

0.86

own

0.84

If

0.83

ο

0.82

0.81

POSITIVE LOGITS

atrical

1.71

odore

1.50

odora

1.40

 Beatles

1.39

atres

1.37

ophylline

1.33

mselves

1.32

matic

1.30

 Hague

1.30

orems

1.28

Activations Density 0.146%