INDEX

Explanations

a

np_max-act · gemini-2.0-flash

The neuron detects figure subpanel labels (e.g. the “A,” “B,” etc. in “Fig. 1A,” “Fig. 1B,” etc.).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_FALL

-0.08

.Cons

-0.07

Seen

-0.07

=datetime

-0.07

 pisc

-0.07

Values

-0.07

.owl

-0.06

.Age

-0.06

esterday

-0.06

_LT

-0.06

POSITIVE LOGITS

числ

0.07

 intermedi

0.07

 الز

0.06

containers

0.06

 استرات

0.06

 Supplementary

0.06

 supplementary

0.06

 ozone

0.06

ulla

0.06

 UIColor

0.06

Activations Density 0.002%

a

The neuron detects figure subpanel labels (e.g. the “A,” “B,” etc. in “Fig. 1A,” “Fig. 1B,” etc.).

No Comments

No Known Activations

a

The neuron detects figure subpanel labels (e.g. the “A,” “B,” etc. in “Fig. 1A,” “Fig. 1B,” etc.).

No Comments

No Known Activations