INDEX

Explanations

Internet chat punctuation

np_max-act · gemini-2.0-flash

The neuron is triggered by the model’s internal header‐delimiter tokens that mark the beginning of a new speaker or system block (e.g. `<|start_header_id|>`).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

tokens that begin assistant responses (especially greeting/intro phrases like "Hello! How can I help you today?").

oai_token-act-pair · gpt-5-mini Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 resource

-0.08

Omega

-0.07

imbabwe

-0.07

γχ

-0.07

 ideal

-0.07

ivery

-0.06

Ch

-0.06

 Filter

-0.06

(compare

-0.06

ipa

-0.06

POSITIVE LOGITS

父亲

0.07

 попыт

0.07

 jTextField

0.07

 περί

0.06

htm

0.06

 pupper

0.06

ництво

0.06

=%.

0.06

 JMenuItem

0.06

 responder

0.06

Activations Density 0.028%

Internet chat punctuation

The neuron is triggered by the model’s internal header‐delimiter tokens that mark the beginning of a new speaker or system block (e.g. `<|start_header_id|>`).

tokens that begin assistant responses (especially greeting/intro phrases like "Hello! How can I help you today?").

No Comments

No Known Activations

Internet chat punctuation

The neuron is triggered by the model’s internal header‐delimiter tokens that mark the beginning of a new speaker or system block (e.g. `<|start_header_id|>`).

tokens that begin assistant responses (especially greeting/intro phrases like "Hello! How can I help you today?").

No Comments

No Known Activations