INDEX

Explanations

working

np_max-act · gemini-2.0-flash

This neuron is highly responsive to the word “example,” especially in contexts offering a working or demo example.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

'image

-0.07

 이유

-0.07

 hombre

-0.07

Zuk

-0.07

 Kabul

-0.06

 viewer

-0.06

()]

-0.06

Caught

-0.06

()},

-0.06

 repetition

-0.06

POSITIVE LOGITS

(pb

0.07

 demo

0.07

_interfaces

0.07

 دول

0.07

 securely

0.07

知

0.07

목

0.07

\Routing

0.06

 canadian

0.06

vb

0.06

Activations Density 0.005%

working

This neuron is highly responsive to the word “example,” especially in contexts offering a working or demo example.

No Comments

No Known Activations

working

This neuron is highly responsive to the word “example,” especially in contexts offering a working or demo example.

No Comments

No Known Activations