INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ставкалары

0.75

irting

0.72

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.72

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.71

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.70

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.70

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.69

↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵

0.69

Dod

0.69

 방법을

0.68

POSITIVE LOGITS

 increase

1.00

 focused

0.92

 excessive

0.84

 uplifting

0.81

 rocket

0.78

 boost

0.77

 optimistic

0.75

 invested

0.75

 increasing

0.74

 fitness

0.74

Activations Density 0.000%

No Known Activations