INDEX

Explanations

code

np_max-act · gemini-2.0-flash

This neuron activates on code identifier tokens, especially PascalCase names and annotations (e.g. class, method, and attribute names).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

(GTK

-0.07

(domain

-0.07

oggles

-0.06

	image

-0.06

 유저

-0.06

 pulses

-0.06

fic

-0.06

(original

-0.06

(org

-0.06

 endeavor

-0.06

POSITIVE LOGITS

长

0.07

 Balanced

0.06

ATORS

0.06

 LIFE

0.06

 dangerous

0.06

lao

0.06

Creators

0.06

§

0.06

 araya

0.06

 Joint

0.06

Activations Density 0.036%

code

This neuron activates on code identifier tokens, especially PascalCase names and annotations (e.g. class, method, and attribute names).

No Comments

No Known Activations