INDEX

Explanations

words whispered in hushed tones

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 affords

0.44

 pilih

0.42

 ensures

0.41

 operates

0.40

 mitigate

0.39

犾

0.39

 provides

0.39

 dipilih

0.38

 team

0.38

arry

0.37

POSITIVE LOGITS

 cyberpunk

0.49

 பரபர

0.48

ﺸ

0.47

("[

0.46

Hidden

0.46

Retour

0.46

 scandalous

0.45

 indecent

0.45

 sogenannte

0.44

 обновления

0.44

Activations Density 0.001%