INDEX

Explanations

on, eat, red, imit, wh, apple, t, 鈴, using, against

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

uelle

0.39

advant

0.36

netic

0.36

bilisi

0.36

恝

0.36

gono

0.36

 inhe

0.36

 Konink

0.36

личе

0.35

 HAVE

0.35

POSITIVE LOGITS

াস

0.36

串

0.33

ারি

0.31

VAT

0.31

`;

0.30

 Companion

0.30

 więc

0.30

 cane

0.30

 mistake

0.29

}`;

0.29

Activations Density 0.034%