INDEX

Explanations

commas and "and"

np_max-act · gemini-2.0-flash

The neuron detects mentions of language-learning skill terms (e.g. listening, reading, writing).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 condoms

-0.07

	State

-0.06

[]):

-0.06

_EC

-0.06

.pipeline

-0.06

 strings

-0.06

 PATCH

-0.06

диви

-0.06

 púb

-0.06

 město

-0.06

POSITIVE LOGITS

يكي

0.07

 separating

0.06

">↵

0.06

ilda

0.06

 fought

0.06

oundingBox

0.06

 frente

0.06

	glfw

0.06

 Senator

0.06

；

0.06

Activations Density 0.122%

commas and "and"

The neuron detects mentions of language-learning skill terms (e.g. listening, reading, writing).

No Comments

No Known Activations