INDEX

Explanations

words about enhanced abilities, aspirations, or fantastical beings

The neuron detects lofty, self-empowering identity nouns or titles—words like “genius,” “superhero,” “goddess,” and “royalty.”

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 other

0.46

 groups

0.45

 extensive

0.44

 changes

0.44

 strenuous

0.43

 mechanisms

0.42

 categories

0.42

 parts

0.42

 periods

0.42

 images

0.41

POSITIVE LOGITS

ここに

0.56

Aquí

0.53

นี่

0.52

！」

0.51

这款

0.51

こんな

0.50

Haha

0.50

😏

0.49

cuando

0.49

Pharmaceutical

0.49

Activations Density 0.000%