INDEX

Explanations

from

np_max-act · gemini-2.0-flash

The neuron fires on methodological terms describing the removal or extraction of a substance (e.g. “extraction,” “remove,” “from”) in scientific protocols.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ーの

-0.07

ailles

-0.06

在线视频

-0.06

 arrayOf

-0.06

 pageable

-0.06

 sırasında

-0.06

 spark

-0.06

olecular

-0.06

[]){↵

-0.06

래

-0.06

POSITIVE LOGITS

("-",

0.07

Dot

0.07

 liberalism

0.07

_checker

0.07

DET

0.07

 Curtis

0.07

 heraus

0.06

�

0.06

Misc

0.06

 Cena

0.06

Activations Density 0.015%

from

The neuron fires on methodological terms describing the removal or extraction of a substance (e.g. “extraction,” “remove,” “from”) in scientific protocols.

No Comments

No Known Activations

from

The neuron fires on methodological terms describing the removal or extraction of a substance (e.g. “extraction,” “remove,” “from”) in scientific protocols.

No Comments

No Known Activations