INDEX

Explanations

historically damaging or generally safer

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 posibil

0.93

 Kunststoff

0.91

 molécules

0.84

 zaken

0.84

 molecules

0.81

 magyar

0.80

烣

0.80

 causas

0.80

 приводит

0.79

 toma

0.78

POSITIVE LOGITS

 उल्लेख

0.77

spike

0.70

bord

0.70

冲

0.69

də

0.67

 വെ

0.67

 منسلک

0.66

mention

0.66

dom

0.65

 प्रांत

0.65

Activations Density 0.000%