INDEX

Explanations

The

np_max-act · gemini-2.0-flash

This neuron is detecting special control tokens that mark the end of a text segment (e.g. the “<|eot_id|>” delimiter).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 France

-0.06

Italy

-0.06

↘

-0.06

<M

-0.06

 ثلاث

-0.06

 Congressional

-0.06

โลย

-0.06

MER

-0.06

ประเภท

-0.06

POSITIVE LOGITS

:-

0.07

 sender

0.07

 вит

0.07

اهرة

0.06

Ś

0.06

ůj

0.06

 incent

0.06

ันด

0.06

host

0.06

income

0.06

Activations Density 0.061%

The

This neuron is detecting special control tokens that mark the end of a text segment (e.g. the “<|eot_id|>” delimiter).

No Comments

No Known Activations

The

This neuron is detecting special control tokens that mark the end of a text segment (e.g. the “<|eot_id|>” delimiter).

No Comments

No Known Activations