INDEX

Explanations

Coding datatypes/objects

np_max-act · gemini-2.0-flash

The neuron activates on occurrences of the word “list,” i.e. mentions of list objects or list types.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 телеф

-0.07

.lesson

-0.06

emperature

-0.06

_phy

-0.06

ोजन

-0.06

icates

-0.06

ушка

-0.06

evity

-0.06

_ap

-0.06

젝

-0.06

POSITIVE LOGITS

 parameter

0.07

 ADDRESS

0.07

 حمایت

0.07

 Γκ

0.07

 khá

0.06

argument

0.06

archs

0.06

CPR

0.06

�

0.06

 publishers

0.06

Activations Density 0.031%

Coding datatypes/objects

The neuron activates on occurrences of the word “list,” i.e. mentions of list objects or list types.

No Comments

No Known Activations

Coding datatypes/objects

The neuron activates on occurrences of the word “list,” i.e. mentions of list objects or list types.

No Comments

No Known Activations