INDEX

Explanations

cater

np_max-act · gemini-2.0-flash

The neuron specifically detects the phrase “cater to.”

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

емых

-0.07

(Method

-0.07

 valide

-0.07

 STATIC

-0.07

 puts

-0.06

 обязатель

-0.06

Hou

-0.06

INS

-0.06

 نیروی

-0.06

 bumps

-0.06

POSITIVE LOGITS

 cater

0.14

 catering

0.11

 Cater

0.11

)reader

0.07

 Attribution

0.07

 serving

0.06

water

0.06

met

0.06

 attribution

0.06

atever

0.06

Activations Density 0.002%

cater

The neuron specifically detects the phrase “cater to.”

No Comments

No Known Activations