INDEX

Explanations

mathematical implication and consequence

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Eighty

0.67

 Seventy

0.67

 líng

0.66

ज्ञात

0.66

 Fifty

0.64

채

0.63

jawab

0.62

ூல்

0.62

measured

0.61

둑

0.61

POSITIVE LOGITS

 implies

0.92

=>

0.91

=>

0.90

implies

0.87

 donc

0.87

Rightarrow

0.86

impl

0.85

 implica

0.83

 nên

0.82

Impl

0.81

Activations Density 0.026%