INDEX

Explanations

moral justification or progress

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

вары

0.54

Like

0.47

ᅨ

0.47

ᅯ

0.46

ոն

0.45

virtual

0.45

Static

0.43

urity

0.41

া

0.41

er

0.41

POSITIVE LOGITS

 Bugünkü

0.47

dà

0.46

 precarious

0.45

帼

0.44

 incó

0.43

 بی

0.43

 البدايه

0.43

 uncomfortable

0.42

 شويه

0.42

ductor

0.41

Activations Density 0.000%