INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

or

0.72

を探

0.68

 explosives

0.66

 расходы

0.66

热

0.64

想要的

0.64

 durg

0.64

などは

0.63

 scorching

0.63

 invoices

0.63

POSITIVE LOGITS

Identity

0.80

 identity

0.78

identity

0.75

 identidad

0.74

ందిన

0.73

absence

0.73

RequestId

0.73

 завжди

0.72

 सामान्य

0.70

ن

0.70

Activations Density 0.000%

No Known Activations