INDEX

Explanations

healthy, responsible, or coherent ways

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 задачи

0.44

tasks

0.42

幾乎

0.40

 거의

0.39

/*

0.37

 gelegt

0.37

 భారీ

0.37

几乎

0.36

 بسیار

0.36

capabilities

0.36

POSITIVE LOGITS

 reliable

0.73

 طریقے

0.69

 efficient

0.58

way

0.58

 enjoyable

0.56

 तरीका

0.55

 manier

0.55

有效的

0.55

 şekilde

0.55

 dependable

0.54

Activations Density 0.032%