INDEX

Explanations

avoiding specific mentions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 වශ

1.06

 truyện

1.05

ட்சத்திர

1.04

 konular

1.04

 vlad

1.03

 Grundlagen

1.03

 ciencia

1.03

 וה

1.02

⭑

1.00

 vardı

0.99

POSITIVE LOGITS

 only

0.98

 essentially

0.96

 avoids

0.87

 seemingly

0.86

isn

0.85

 county

0.83

 ഒരു

0.83

 avoid

0.83

 doesn

0.82

 seems

0.81

Activations Density 0.002%