INDEX

Explanations

inflicting pain or suffering

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Theorem

0.49

data

0.47

leukin

0.46

 Theorem

0.46

菫

0.46

なども

0.45

ed

0.45

 reluctance

0.44

zip

0.42

ergic

0.42

POSITIVE LOGITS

 róż

0.44

 різ

0.44

 кожного

0.42

目的是

0.42

 głów

0.42

 цього

0.42

 администра

0.41

 этого

0.41

왜

0.41

 chyba

0.40

Activations Density 0.005%