INDEX

Explanations

start begin

np_max-act · gemini-2.0-flash

This neuron detects Portuguese instructional cues—especially the verb “começar” (and its token fragments) marking the start of a procedure or step.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

部

-0.06

nten

-0.06

 opat

-0.06

อค

-0.06

 گذاری

-0.06

์ค

-0.06

参考

-0.06

чної

-0.06

 kayb

-0.06

申博

-0.06

POSITIVE LOGITS

 architecture

0.06

 Jeff

0.06

 раді

0.06

/community

0.06

 plán

0.06

uffle

0.06

 accompl

0.06

 imagine

0.06

Git

0.06

";

0.06

Activations Density 0.009%

start begin

This neuron detects Portuguese instructional cues—especially the verb “começar” (and its token fragments) marking the start of a procedure or step.

No Comments

No Known Activations

start begin

This neuron detects Portuguese instructional cues—especially the verb “começar” (and its token fragments) marking the start of a procedure or step.

No Comments

No Known Activations