INDEX

Explanations

sexual objectification and exploitation

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

<b>

0.40

key

0.38

key

0.37

ometimes

0.37

 odpowied

0.37

 சில

0.36

ṣ

0.36

POSITIVE LOGITS

 এতটা

0.85

这么

0.84

 столь

0.83

 такого

0.83

如此

0.82

 ilyen

0.80

 such

0.79

這麼

0.78

 इतनी

0.77

这么多

0.74

Activations Density 0.532%