INDEX

Explanations

(

np_max-act · gemini-2.0-flash

The neuron strongly activates on numeric literals and arithmetic expressions (e.g. numbers, decimal constants, and math operators).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 suchen

-0.07

 کوچ

-0.07

 framing

-0.07

 signIn

-0.07

 plated

-0.06

 případně

-0.06

Sah

-0.06

 garner

-0.06

bugs

-0.06

Kos

-0.06

POSITIVE LOGITS

raig

0.07

lbrakk

0.06

 Ohio

0.06

(completion

0.06

 Patch

0.06

 Fear

0.06

.filename

0.06

ManagerInterface

0.06

fore

0.06

airo

0.06

Activations Density 0.003%

(

The neuron strongly activates on numeric literals and arithmetic expressions (e.g. numbers, decimal constants, and math operators).

No Comments

No Known Activations

(

The neuron strongly activates on numeric literals and arithmetic expressions (e.g. numbers, decimal constants, and math operators).

No Comments

No Known Activations