INDEX

Explanations

sexual practices and moral reasoning

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

。",

0.37

Depos

0.36

Formation

0.36

 incarcer

0.35

ิติ

0.34

...."

0.34

Spending

0.34

ᅦ

0.34

﹙

0.34

 incarcerated

0.34

POSITIVE LOGITS

 belirli

0.50

 berpeng

0.46

 detalhes

0.46

 Pong

0.45

rollback

0.44

詳しくは

0.43

 Espíritu

0.43

 यूरोप

0.42

க

0.41

 अनि

0.41

Activations Density 0.004%