INDEX

Explanations

principles

The neuron fires on explicit mentions of “rule” (often numbered or named, e.g. legal rules, technical/right-hand rules, etc.).

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 convoc

0.73

 editorials

0.69

শিদ

0.68

 رجسٹ

0.67

Aye

0.66

 edecek

0.66

𝙥

0.66

 Ander

0.65

 tasar

0.64

嬂

0.64

POSITIVE LOGITS

 법칙

0.96

 adage

0.90

 Principle

0.89

定律

0.88

 principle

0.86

 theorem

0.86

 hypothesis

0.84

 criterion

0.82

 Theory

0.82

applicable

0.80

Activations Density 0.318%