INDEX

Explanations

each

np_max-act · gemini-2.0-flash

This neuron detects references to the positions (first, second, third, etc.) of elements in tuple- or pair-structured data descriptions.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.Bind

-0.07

.Z

-0.07

事

-0.07

Pal

-0.06

olves

-0.06

aju

-0.06

 tribute

-0.06

 endorsing

-0.06

rey

-0.06

POSITIVE LOGITS

 JAXB

0.07

 mixin

0.06

花

0.06

(old

0.06

ฺ

0.06

 Шев

0.06

 جریان

0.06

 CallingConvention

0.06

 роки

0.06

 suic

0.06

Activations Density 0.021%

each

This neuron detects references to the positions (first, second, third, etc.) of elements in tuple- or pair-structured data descriptions.

No Comments

No Known Activations

each

This neuron detects references to the positions (first, second, third, etc.) of elements in tuple- or pair-structured data descriptions.

No Comments

No Known Activations