causing negative outcomes

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 کوي

0.70

یی

0.68

ي

0.65

rógeno

0.63

ചര്യ

0.63

 možnost

0.62

ți

0.61

 combustível

0.61

 নিয়ন্ত্র

0.60

 возможностей

0.60

POSITIVE LOGITS

0.81

0.76

0.75

0.66

0.60

st

0.59

 havoc

0.59

Activations Density 0.047%