INDEX

Explanations

blog posts

np_max-act · gemini-2.0-flash

This neuron fires on the headings or section titles (often including numbered labels or post‐style titles) that introduce new segments in a blog/review format.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

OL

-0.07

_OBJ

-0.07

 mask

-0.07

 simply

-0.07

oney

-0.06

 Wilson

-0.06

 bust

-0.06

 autoimmune

-0.06

 leases

-0.06

 indica

-0.06

POSITIVE LOGITS

////////////////////////////////////////////////////////////////////////////////////////////////

0.06

 товарів

0.06

 많은

0.06

ा।↵↵

0.06

 ctxt

0.06

')}}↵

0.06

ivariate

0.06

ΙΚΗΣ

0.06

InstantiationException

0.06

Reminder

0.06

Activations Density 0.240%

blog posts

This neuron fires on the headings or section titles (often including numbered labels or post‐style titles) that introduce new segments in a blog/review format.

No Comments

No Known Activations

blog posts

This neuron fires on the headings or section titles (often including numbered labels or post‐style titles) that introduce new segments in a blog/review format.

No Comments

No Known Activations