INDEX

Explanations

example followed by punctuation

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 neuroscience

0.80

...",

0.75

ोटा

0.73

 ...,

0.71

羣

0.70

 truth

0.69

 লেখা

0.69

!",

0.69

 struggling

0.69

",

0.69

POSITIVE LOGITS

0.99

،

0.92

,-

0.92

гӀ

0.79

нибудь

0.78

,-\

0.73

ấp

0.71

влено

0.70

,(

0.69

otification

0.69

Activations Density 0.152%