INDEX

Explanations

cross

np_max-act · gemini-2.0-flash

The neuron activates on words describing plant‐breeding operations—especially terms like “cross,” “breeding,” “hybrid,” “seed,” and similar words indicating crossing or pollination events.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 payment

-0.07

(pk

-0.06

.xr

-0.06

 enemies

-0.06

 NEWS

-0.06

([]);↵↵

-0.06

 Văn

-0.06

영상

-0.06

 Razor

-0.06

 평균

-0.06

POSITIVE LOGITS

ového

0.07

 italiano

0.07

 Controlled

0.06

 duplicated

0.06

ęp

0.06

auen

0.06

 krás

0.06

 Continental

0.06

Activated

0.06

 sexle

0.06

Activations Density 0.005%

cross

The neuron activates on words describing plant‐breeding operations—especially terms like “cross,” “breeding,” “hybrid,” “seed,” and similar words indicating crossing or pollination events.

No Comments

No Known Activations