INDEX

Explanations

idiomatic punctuation and non-Latin scripts

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 numerous

0.65

 observable

0.65

 eigenvectors

0.63

 bequest

0.63

듕

0.62

 demolished

0.62

८

0.62

𝗥

0.61

 dynamically

0.60

ἆ

0.60

POSITIVE LOGITS

ર

0.90

ر

0.89

ل

0.84

м

0.80

ம்

0.79

ной

0.79

ת

0.79

ند

0.76

けた

0.72

ین

0.71

Activations Density 0.258%