INDEX

Explanations

The neuron is sensitive to step‐by‐step transition cues—words like “next,” “previous,” “continuing,” and similar that signal the progression of sequential reasoning.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ême

-0.08

 Hoffman

-0.07

occan

-0.07

 Expand

-0.07

_nb

-0.07

Dia

-0.07

 harus

-0.06

紙

-0.06

 lamps

-0.06

.si

-0.06

POSITIVE LOGITS

(CONFIG

0.07

 Shaw

0.07

_release

0.06

 результат

0.06

 compromises

0.06

 tract

0.06

 Rocky

0.06

 Lindsey

0.06

 footing

0.06

 Shiv

0.06

Activations Density 0.012%

next

The neuron is sensitive to step‐by‐step transition cues—words like “next,” “previous,” “continuing,” and similar that signal the progression of sequential reasoning.

No Comments

No Known Activations