INDEX

Explanations

code, script, explanation, or technical documentation

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ură

0.44

请

0.44

 Einheit

0.42

Decade

0.41

 Supply

0.41

azers

0.41

 unité

0.40

 OPERATIONS

0.40

ază

0.40

 단위

0.40

POSITIVE LOGITS

 downward

0.39

 persuaded

0.37

 pepperoni

0.36

رخ

0.35

incl

0.33

裄

0.33

下げ

0.33

 glow

0.33

悠

0.32

 کوتا

0.32

Activations Density 0.004%